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ABSTRACT 

Typical end-of- course faculty ratings were questioned 
for their inability to measure actual classroom interaction. 

Extending the concept of these evaluations to include the student 
instructional expectations dimension, the study proposed that the 
classroom experience be related to the process and systems 
approaches, more dependent upon monitoring and feedback. As a working 
model for classroom role structure, it employed Stogdill's group 
achievement framework to distinguish between those expectations 
associated with institutional dynamics — called situational — and those 
related to the classroom — called contextual, instruments were 
administered at the opening — and close — of a semester to 209 students 
in 8 social science classes in 3 colleges. Developed from previous 
Critical Incident and factor analysis techniques, the 20 instrument 
items employed modified Semantic Differential scales. Additionally, 
the study examined relationships between expectations and the AVL 
(Allport-Vernon-Lindzey) Value Scale. The substantive hypotheses — for 
each class and demographic variable — found statistically significant 
differences between expectations and evaluations, even for high 
evaluations. If these evaluations demonstrated the direction of 
expectations, but failing to indicate their intensity, the study 
concluded that extrapolations made solely from evaluations were of 
questionable value in measuring the range of classroom dynamics. AVL 
intercorrelations did not produce any significant patterns, however. 
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INTRODUCTION 



This research project began largely by chance and cir- 
cumstance. About three years ago a group of students active in winning 
acceptance for a student academic affairs committee at our college, 
asked me to help construct an end- of- course evaluation instrument. 

In the few hours allotted me by those busy and involved students, an 
instrument was prepared. Its subsequent use failed to stir up the an- 
ticipated anxieties; some faculty even welcomed it. As I shared the 
usual curiosity regarding faculty evaluation, I began gathering infor- 
mation about the field and tried various home-grown experiments in 
self-evaluation. A search through the literature, however, proved 
disheartening. Despite the growing enthusiasm of students for faculty 
evaluation, most investigators seemed to lack concern with integrating 
their findings into a comprehensive conceptual framework. Moreover, 
some appeared to be unaware or unconcerned with growing student 
demand that effective teaching and effective learning no longer be 
treated as separate issues. . • 

In developing a research prospective, as described in this 
thesis, one basic question kept recurring: did course evaluations begin 
at the right juncture in the student-teacher relationship? Clearly, any 
realistic assessment of higher education, including evaluations research. 



demanded the inclusion of both teacher and student domains. 

In the broader sense this thesis developed at a time when 

critical perspectives regarding the student-teacher relationship are 

emerging. If this relationship served as a basic education unit, 

recent works maintained that higher education must reacquaint itself. 

with its importance. The work of Joseph Katz was most influential; 

he challenged investigators in undergraduate education to rethink their 

traditional models by accepting the ideal of the fullest possible develop- 
# * 

ment of the individual student. Katz and his associates at Stanford 
University went beyond Sanford’s earlier notion that curriculum must 
be a function of individual personality development, by concluding that 
this development must be reflected in all aspects of college experience. 
On a more polemical note, Harold Taylor voiced concern in sweeping 
terms:'* 

Learning and 1 ' teaching . . . have to do with the 
totality of human conduct, in which the conduct of affairs 
of the’ mind is by turns, political, social, public, private, 
intellectual, emotional, external, internal, and, in the 
last analysis, personal. Otherwise conduct has not 
meaning, the human act is stripped of motivation, empty 
of Content, lacking in truth. 

In reviewing recent works in higher education, Austin declared that 

. * 

although not all would completely support Taylor, educators and 



* Joseph Katz, et al . , Nq Time For Youth- (San Francisco: 
Jossey-Bass, Inc., 1968), p. 160. 

^Harold Taylor, Stud ents Without Teachers: The Crisis in 
the University (New York: McGraw-Hill Bpok Company, 1969), p. 321. 
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researchers agreed that "a relevant curriculum must give attention 

both to the growth of the person and the building of a better society, 

3 

neither of which can be sought in isolation from the other." 

Accepting this charge, Chickering advanced seven basic 

4 

developmental tasks associated with college experience. They were: 
achieving competence, managing emotions, becoming autonomous 
establishing identity, freeing interpersonal relations, Clarifying purposes, 
and developing integrity. Although Chickering offered suggestions to 
implement these developmental objectives, notably by increasing the 
meaningful interaction of instructor and student, similar expressions 

5 

characterizing the classroom as a community were proposed by Schwab. 
He envisioned the classroom relationship essentially as a collaborative 
one, one that will be proposed by this thesis. Schwab stated that if 
the classroom presented an obvious "visibility of roles," there must 

V. 

also be "an exchange of roles where exchange is proper — so that students 
discover what it is like to be a professor and professors rediscover 
what it is like to be a student. It is a human society--not a society of 

<r* 

disembodied minds; when professors confront students in the seminar 

% 

room, they do so as persons who can be annoyed. or irritated by students, 

• • . • 

^C. Grey Austin, "The Year's Books," Journal of Higher 
Education, 40 (November, 1969), 589-592. 

• ^Arthur W. Chickering, Education and Identity (San Francisco, 
Jossey-Bass, Inc. , 1969), pp. 9-19. 

5 Joseph J. Schwab, College Curriculum and Student Protest 
(Chicago: University of Chicago Press, 1969). 
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who can be impatient, tired, distracted, who are concerned for 
students as well as with them — and by this visible freedom, enable 
students to be similarly free." His emphasis was not on role pre- 
scription, but on role sharing. As Chickering's and Schwab's conceptions 
reflected changing pedagogical and organizational patterns in higher 
education, it became imperative not only to learn more about the behavior 
of student evaluations, but student expectations of instructors as well. 

If the concepts of expectations and evaluations received wide 

i * 

circulation in the behavioral and social sciences, both seemed to lack 

4 

ainy clear-cut or consistent definitions. For the purposes of this. study, 
however, working definitions may be borrowed from ordinary language. 
Although expectations related to instruction will be discussed later, it 
can be defined here as the degree of probable occurrence of something 
believed to be desirable. Evaluations would be the summing up or 
measuring of the relative worth of actual occurrences. . As used here, 
evaluations determined instructor effectiveness by employing rating 
instruments. 

This study was organized in the following way: the first 

chapter reviewed how previous evaluations studies contributed to the 

• • 

expectations- evaluations framework developed here. The second chapter 
explored the nature of instructional expectations. Chapter three outlined 

V 

^Joseph J. Schwab, College Curriculum and Student Protest 
(Chicago: University of Chicago Press, 1969), p. 230. 
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research procedures and offered several hypotheses. The fourth chap- 
ter presented the findings. And chapter five discussed conclusions and 
implications of the study. 
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CHAPTER I 



THE PROBLEM OF RELATING EXPECTATIONS TO EVALUATIONS 



The controversy over student evaluations of their instructors 
does not suffer from the lack of empirical evidence. A recent review 
of the literature, with its bibliography of almost 300 recent research 
items has indicated that student evaluations could become more accept- 
able as important aspects of college education. * Yet the state of current 
research does not mask the continued debate regarding student evaluators. 
In a recent panel discussion of educators a panelist asked: "The question, 
ladies and gentlemen, is not why evaluation, not whether evaluation, 
but how evaluation is to be accomplished?" This only draws attention 

to the basic questions addressed by previous student evaluations re- 

3 

search. The question could contribute, nonetheless, to a clearer 



^Kenneth E. Eble, The Recognition and Evaluation of 
Teaching , Report of the Project to Improve College Teaching, Salt 
Lake City, Utah, November, 1970 (Salt Lake City, Utah: Project 
to Improve College Teaching, 1970), pp. 101-111. • * 

2 

Walter Schoen, "Faculty Evaluation, Pro and Con: The 
Hypocrites,” Report of the Twenty-Second Annual Conference , 1969 
(Cazenova, New York: New York State Association of Junior Colleges, 
1969), p. 34. 

3 lbid., pp. 34-41. 
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understanding of the evaluative process by raising a prior question; 
namely, what is being measured by students in these instructed 
ratings ? The argument of this paper is that the ratings measure 
what students expect of the instructor. Hence, this paper focuses on 
college student role expectations of instructors. 

The Student as Evaluator 

In order to determine how this perspective developed, the 
rationale and findings of the evaluations field itself must be examined. 

• The most prominent form of evaluations research focused on 
end-of- course student ratings. Characteristically, these studies 
began with the premise that "as higher education is organized and 
operated, students are pretty much the only ones who observe and 
are in a position to judge the teacher's effectiveness. A recent 
writer added that, "at present the only regular observers of the 
teacher in action are his students."'* Generally, researchers ' 
correlated one intervening student variable such as sex or grades 

^H. H. Remmers, "On Students' Perceptions of Teachers' 
Effectiveness, " The Appraisal of Teaching In Large Universities , ed. 

W. J. McKeachie (Ann Arbor; University of Michigan Press, 1950), 20. 

5 

Paul H. Owen, "Some Dimensions of College Teaching: 

An Exploratory Study Using Critical Incidents and Factor Analysis of 
Student Ratings" (unpublished Doctoral dissertation. University of 
Houston, 1967), p. 11. 
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with end-of-term instructor ratings. Presumably, this would uncover 

whether students were unduly prejudiced in their ratings. A leading 

figure in the field since the 1920's, H. H. Remmers, who conducted 

a number of rating studies at Purdue University, summarized the 

field's major research findings up to 1959. Those related directly 

6 

to college teaching were: 

1. Grades of students are not in general closely related 
to the ratings of the teacher. 

2. There is evidence showing that there is little, if 
any, relationship existing between student ratings 
of teachers and the judged difficulty of the course. 

3. In a given institution there exists wide and important 
departmental differences in effectiveness of teaching 
as judged by student opinion. 

4. The sex of the student rater bears little or no 
relationship to the ratings of teachers. 

5. While the effect on student ratings of a generalized 
attitude (the 'halo effect') toward'the teacher has 
not been isolated, it apparently does not exist to 

an extent sufficient to invalidate the ratings of separate 
aspects of teaching methods and of the course. Evi- 
dence indicates that students discriminate reliably 
for different aspects of teacher's personality and of 
the course and between different instructors and 
courses. 

6. Teachers with less than five years experience tend 
to be rated lower than teachers with more than 
eight years experience. 

7. Mature alumni of ten years' standing agree substan- 

tially with on- campus students in their evaluation 
of teachers. * 

8. The year in school of the rater has no effect on 
the ratings given except that ratings by graduate 
students tend to be higher than those by undergraduates. 



R 

H. H. Remmers, "On Students’ Perceptions of Teachers' 
Effectiveness, " The Appraisal of Teaching In Large Universities , 
ed. W. J. McKeachie (Ann Arbor: University of Michigan Press, 
1959), 21-22. 
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He cited two other findings not yet fully studied in the college setting, 

that "a considerable number of those who have used student rating 

believe this procedure is useful for facilitating the educational 

process," and, knowledge of student opinions and attitudes leads 

to improvement of the teacher’s personality and educational pro- 

cedures." In conclusion, he noted: "No research has been pub- 

% 

lished invalidating the use of student opinion as one criterion of teacher 

O 

effectiveness." In fact, in proposing a rating instrument for general 

use, McKeachie restated most of the earlier findings and reemphasized 

g 

the reliability of student opinion in evaluation. 

Updating these conclusions, investigations in the last ten 
years 'generally followed the single- variable and end- of- course 
rating pattern. Widespread evidence reconfirmed that neither 
age, school- year level, grade-point average, sex or course grades 

n 

H. H. Remmers, "On Students’ Perceptions of Teachers’ 
Effectiveness," The Appraisal of Teaching In Large. Universities , ed. 

W. J. McKeachie (Ann Arbor: University of Michigan Press, 1959), 21. 

^ Ibid . , p. 22. 

m 

®W. J. McKeachie, "Student Ratings of Faculty, " AAUP 
Bulletin, 55 (December, 1969), 439-444. 



•f*y 



10 



10 

adversely affected ratings. One study at the California State 
College at Long Beach pointed out that even when students were 
actually cued to previous ratings of an instructor, they did not 
passively record their predecessors view. ^ As anticipated, a 
number of studies still focused on the influence expected^rades had 
on ratings. A study done at the University of California at Berkeley 

supported the proposition that the rating scores will not be strongly 

1 2 

biased by the leniency of the instructor in assigning grades. 

While another, at the University of Washington extended these 
conclusions: "High ratings cannot be 'bought' by giving .high grades, 

nor are they lost by giving low grades. Both when judging their 
instructor's over-all value as a teacher and when rating his skill 
in specific respects, such as clarity of presentation and development 
of interest the students rarely, if ever, were influenced by the grades 



^Nicholas F. Rayder, "College Student Ratings of 
Instructors," Journal of Experimental Education , 37 (Winter, 1968), 
76-81, and William E. Coffman, "Determining Students' Concepts 
of Effective Teaching from Their Ratings of Instructors," Journal 
of Educational Psychology, 45 ( May, 1954) , 277-286. • * 

^ James N. McClelland, "The Effect of Student Evaluations 
of College Instruction Upon Subsequent Evaluations, " California 
Journal of Educational Research, 31 (March, 1970), 88-95. 

12 

C.' M. Garevick and H. D. Carter, "Instructors Ratings 
and Expected Grades, " California Journal of Educational Research, 

13 (November, 1962), 218-221. 
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which they had received from that teacher."*** Seeking to determine 

the reliability of student ratings when different instructions were 

used on the same class, a study done at Brooklyn College reported 

"that students’ ratings of teachers . . . are not greatly influenced 

by the different administrations of the questionnaires."* * *- * Moreover, 

no significant pattern relating authoritarianism in students with 

either high or low ratings was found at a "large southern woman's 

college." An earlier study demonstrated that a panel of instructors 

plus administrators along with a random sample of students did not 

differ significantly in their assessment of the effectiveness of actual 

1 fi 

classroom situations. A recent study at Clemson University re- 
confirmed reports that "personal qualities of the teacher, sex of 
the student, grades in the class, and overall GPR / grade-point rating^/ 

***Virginia W. Voeks and G. M. French, "Are Student- Ratings 
of Teachers Affected by Grades?" Journal of Higher Education, 31 (June, 
1960), 330-334. 

* ^Manuel Cynamon and Shirley U. Wedeen, "Emotional 
Factors in Reliability of Student Ratings of Teachers, " Journal of 
Educational Research , 51 (April, 1958), 629-632. 

*®Ann C. Maney, "Authoritarianism Dimension in Student 
Evaluations of Faculty," Journal of Educational Sociology, 32 (January, 
1959), 226-231. 

1 fi 

Ruth E. Eckert, "Ways of Evaluating Teaching, " School 
and Society , 71 (February 4, 1950), 65-69. 
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did not influence student assessment of teacher performance." 

If this evidence established that "college students appear 
to have greater objectivity and less superficial value systems than we 
have realized,"^ another conclusion should be that the more re- 
searchers tried to uncover intervening variables influencing — that is, 
prejudicing — student choices, the more the findings discounted them. 
Few empirical studies, moreover, offered any substantial evidence 
disputing the concept of student end-of-term evaluations. 

Developing Traits of the Ideal Professor 



With the cumulative results of ratings studies giving 
impressive testimony to the judiciousness of students as evaluators, 
the second notable trend of evaluations research identified the image 
of the ideal professor. Concerned with eliciting general or overall 
characteristics of an ideal professor, these studies utilized both closed 
and open-ended instrument techniques. More recently, some studies 
examined whether differences associated with a student’s major field, 
grade-point average, sex, year in school, as in the ratings studies. 



17 

1 Bernard Caffrey, "Lack of Bias in Student Evaluations 
of Teachers, " Proceedings of the 77th Annual Convention of the 
American Psychological Association , Vol. IV, Pt. II, 1969 (Washington, 
D. C.: American Psychological Association, 1969), pp. 641-642. 

18 Virginia W. Voeks and G. M. French, "Are Student-Ratings 
of Teachers Affected by Grades?" Journal of .Higher Education , 31 
(June, 1960), 330-334. 
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influenced perceptions of the ideal professor. Despite certain 

differences a general pattern of the ideal professor emerged. 

The earliest, and still most systematic attempt was a study 

of an entire college undertaken by Riley and his associates at Brooklyn 

19 

College in 1950. Deriving a list of some twenty items from a pre- 
viously administered open-ended questionnaire, students were asked 

to rank them in order of importance as teaching attributes. The ten 

20 

items most often selected were the following: 



1. Knowledge of the subject. . J38% 

2. Attitude toward the subject .78% 

3. Organization of subject matter ..... .75% 

4. Attitude toward students 72% 

5. Personality of instructor 72% 

6. Speaking ability .72% 

7. Ability to explain 71% 

8. Tolerance to disagreement 71% 

9. Fairness in examinations ....60% 

10. Encouragement to thinking 55% 



A similar breakdown resulted from other studies employing 
open-ended techniques. Asking students to write a theme relating 
n the most effective professor they knew," Crawford and Bradshaw 



*9john W. Riley, Jr., Bryce F. Ryan, and Marcia Lifshitz, 

The Student Looks at His Teacher (New Brunswick, New Jersey: Rutgers 
University Press, 1950). The sample numbered more than 6, 000 students. 

20 Ibid. , p. 82. 

L. Crawford and H. L. Bradshaw, “Perception of 
Characteristics of Effective University Teachers: A Scaling Analysis," 
Educational and Psychological Measurement,' 28 (Winter, 1968), 1079-1085. 
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of Ohio University reported a list of behaviors unusually similar to 

22 

Riley's rank order: 

1. Has a thorough knowledge of subject matter plus sub- 
stantial knowledge in other fields. 

2. Lectures are well planned and organized. 

3. Is enthusiastic, energetic and has a lively interest 
in teaching. 

4. Is student- oriented; willing to help students out- 
side of class. 

5. Encourages student participation in class by 

questions and discussion. • . . 

6. Relates class material to other fields. 

7. Speaks clearly and distinctly. 

8. Lacks defensive attitudes and prejudices. 

9. Defines clearly the basis for evaluation of 
students' performances. 

10. Uses a variety of teaching devices, demonstrations, 
charts. 

In a study at Western Washington State College, Gadzella, likewise, 

examined the most and least prominent characteristics of college 

instructors. The five most important ones were: “knowledge of 

subject, interest in subject, flexibility, organization of daily and 

course preparations, and presentation of course materials." And 

the five least noted were: "publisher- writer, community participator, 

oo 

researcher, appearance and punctuality." Quick and Wolfe con- 
sidered the qualities students at the University of Oregon attribued 

% 

to the ideal college professor. The results again ’confirmed the , 



• go 

P. L. Crawford and H. L. Bradshaw, "Perception of 
Characteristics of Effective University Teachers: A Scaling 
Analysis, " -Educational and Psychological Measurement , - 28 (Winter, 
1968), 1083. 

^Bernadette M. Gadzella, "College .Students' Views and Ratings 
of an Ideal Professor, "College and University, 44 (Fall, 1968), 89-96 
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ideal as one who "(a) stimulates students to think independently rather 

than to memorize, (b) presents well organized course work and subject 

matter, (c) is genuinely enthusiastic about his subject, (d) explains 

material clearly, and (e) accepts and values student differences in 

opinion." In contrast, the three least important statements concerning 

the ideal professor described him as one who "(a) is scholarly as well 

* • 

as an active researcher, (b) has an adequate speaking voice, and, (c) 

.24 

likes and is interested in college age youth as individuals.'" These 

findings were supported by work done by Yamamoto and Dizney at 

Kent State University, noting additionally, no sex differences in the 

25 

items selected by the students. 

In spite of the students' relative lack of concern with research 

and writing in the ideal teacher images, two sociologists recently 

tried to determine whether student ratings could be, nonetheless, 

26 

correlated with actual instructor publications. They developed a 



*^Alan F. Quick and Arnold D. Wolfe, "The Ideal Professor," 
Improving College and University Teaching, 13 (Summer, 1965), 133-134. 

25 Kaoru Yamamoto and H. F. Dizney, " Eight Professors: 

A Study on College Students' Preferences Among Their Teachers," 
Journal of Educational Psychology, 57 (June, 1966), 146-150. 

2 ®William M. Stallings and Sushila Singhal, "Some Observations 
on the Relationships Between Research Productivity and Student 
Evaluations of Courses and Teaching, 11 The American Sociologist , 5 
(May, 1970), 141-143. . 
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"Research Productivity Index" tabulating faculty publications at two 

midwestern state universities and correlated it with student ratings. 

Their findings were inconclusive: "The bulk of the data lend some 

support to the position that 'publication is not associated with poor 

teaching. ' Conversely, these same data do not offer convincing 

27 

• proof that publication is related to good teaching." In other words, 
publishing per se was not a function of g<?od or bad teaching, as rated 
by students, at least. 

A study conducted at the University of Toledo proposed specific 

28 

items to be used in the development of valid ratings schedules. 

Employing a cross-section sample of students, faculty, and alumni, 

it "centered on identifying effective teaching behaviors and determining 

29 

their relative importance." An open-ended technique identified 
"effective teacher behaviors," and gathered responses from 812 
students, 166 faculty, and 665 alumni. A panel categorized the res- 
ponses into sixty criterion items which were rated on -a five-point 



27 

William M. Stallings and Sushila Singhal, "Some Observations 
on the Relationships Between Research Productivity and Student Evalua- 
tions of Courses and Teaching," The American Sociologist, 5 (May, 1970), 
142. 

. • 

^Richard R. Perry, "Criteria of Effective Teaching Behavior 
in an Institution of Higher Education, " Proceedings of the Seventh 
Annual Forum of the Association for Institutional Research (n. p.: The 
Association for Institutional Research, 1967), pp.49-59. 

29 Ibid. , p. 5). 






scale by another cross-section sample. Significantly, despite the 



spread of sixty items, compared usually with twenty, the first and 
last ten items virtually parallelled those of .other ideal professor 



studies involving student samples only. The most important items 
identified were:^ 



1. Being well prepared for class 

2. Establishing student interest in the subject being 
taught 

3. Demonstrating comprehensive knowledge of his 
subject 

4. Using teaching methods which enable students to 
achieve objectives of the course 

5. Constructing tests which search for understanding on 
the part of the students rather than rote memory ability 

6. Being fair and reasonable to students in evaluation 
procedures 

7. Communicating effectively at levels appropriate to 
the preparedness of students 

8. Encouraging intelligent independent thought by 
students 

9. Organizing the course in logical fashion 

10. Motivating students to do their best 

The least important items of the sixty Were: 

51. Beginning and ending classes on time 

52. Being neatly dressed 

53. Being knowledgeable about the community in which 
he lives 

54. Having irritating personal mannerisms 

55. Involving himself in appropriate university 

committees . * 

56. Holding membership in scholarly organizations 

57. Being consistently involved in research projects 



^Richard R. Perry, "Criteria of Effective Teaching 
Behavior in an Institution of Higher Education, 11 Proceedings of the 
Seventh An nual Forum of the Association for Institutional Research 
(n. p. : The Association for Institutional Research, 1967), pp. 57-58 
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58. Devoting time to student activities on campus 

59. Making appearances which assist programs of 
community organizations 

• 60. Publishing material related to his subject field 
Although the Toledo study aimed specifically at validating items for 
evaluation instruments, other studies attempted to identify those 
ideal professor items associated with differing discipline areas-. 

Riley's study also probed those characteristics associated 
with instructors in different subject areas. The five most frequently 
chosen attributes for instructors in the arts,, sciences and social 
sciences he found were: 



Arts 




Knowledge of subject 




Encourages thought 


. . . 47% 


Enthusiastic 


46% 


Ability to explain clearly. . . . . 


. . . 42% 


Systematic organization 




(of subject matter) 


32% 


Sciences 




Ability to explain 


.. .89% 


Organization 




Knowledge . 


. . . 70% 


Encourages thought 


. . . 17% 


Enthusiasm 




Social Sciences 




Encourages thought 


. . . 70% 


Organization . 


. . . 48% 


Tolerance to disagreement. . . 


. . . 45% 


Knowledge 


... .42%' 


Explanation 


,...38% 
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John W. Riley, Jr. , Bryce F. Ryan, and Marcia Lifshitz, 
The Student Looks at His Teacher (New Brunswick, New Jersey: 
Rutgers University Press, 1950), p. 75. • 
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Even though Riley believed the differences identified with each 

subject area to be significant, he was as much impressed by 

"the relatively high degree of similarity in the judgments of different 

segments of the student body regarding ideal instructional traits. 

None of the variables studied reveal striking differences in the traits 

commonly judged most important. Although interesting differences 

arise, the proportion of students of different types selecting certain 

ideal traits, there is an over- all homogeneity in popular demand for 

certain attributes, depending upon the type of course given, but not 

32 

upon student characteristics insofar as they are measured here." 
Focusing on the grade differences among students. Turner's study 
of honors, average, and randomly selected students at Indiana 

University also found no remarkable difference in their choices of the 

33 

ideal professor. 

Recently Lewis' survey of upper division students at a "large 
Northeastern university," determined the images- students attributed 
to "three academic occupations (professors of science, social science, 

32 

John W. Riley, Jr. , Bryce F. Ryan, and Marcia Lifshitz, 
The Student Looks a t His Teacher (New Brunswick, New Jersey: 

Rutgers University Press, 1950), p. 77. 

• ^ 

33 

Richard L. Turner, et al . , "How Do Student Characteristics 
Affect Their Evaluations of Instruction," Indiana University School of 
Education Bulletin, 45 (July, 1969), 48-97. 
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and humanities) . . . utilizing twenty sets of polar adjectives. 1,0 
Supporting Riley, he reported that among students with dissimilar 
socialor academic backgrounds, "no consistent differences were 
found f ~ among them / when they were divided by sex, social class, 
year in college, grade average and extent of extracurricular 

OC 

participation. . ... 1,00 Lewis noted, however, "marked differences 
among those in different majors." Although the results were not 
uniform, a definite pattern of inversion emerged. For example, 
students majoring in science and humanities perceived their depart- 
mental instructors as being high in congeniality, while each group 

37 

inverted. this image for- the other department's instructors. What 
seemed significant was not that there were differences, such as Riley 
and Lewis found by subject area, but that the differences tended to be 

related to instructor behaviors, rather than student characteristics. 

• \ 

Furthermore, these subject area differences could be under- 
stood in relation to accepted concepts of role orientations. For 
within each subject area reported by Riley and Lewis, the ideal 



^Lionel S. Lewis, "Students Images of Professors, " 
Educational Forum , 32 (January, 1968), 186 - 

35lbid. , p. 188. 
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Ibid. 



37 



Ibid. , p. 189. 
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professor traits crossed the full spectrum of role orientations. Thus, 
despite a certain role dominance, such as instrumental-task ones 
in the sciences, compared with expressive- integrative ones in the 
arts, no subject area was identified with one orientation at the exclusion 
of the other. 

Although investigations of the ideal professor served a 

■ 

number of purposes, these studies, along with the ratings ones, were 
criticized for losing sight of the dynamics of the actual teaching-learning 
process. Investigators responded by noting their findings produced 
high degrees of correlation, or perhaps significant factor-loadings 
for many of the scale items used. Yet critics more, often challenged 
these results on idiosyncratic, not methodological grounds. In dis- 
avowing a questionnaire for rating teaching, a dean at Yale University 
38 

charged: 

As yet we do not know with any degree of positiveness, 
what combinations of qualities make the successful 
teacher. There are doubtless various combinations. 

Of my own best teachers in school and in college one 
man was rather irritable; one was timid and shy; he 
limped and walked with a stick; and another was sar- 
castic with an amusing streak of humor in his sarcasm; 
another disguised his seriousness with a whimsical 
manner; another was most intolerant of differing opinions. 

But we students felt they were all men. They were all . » 
competent. Two were great scholars. They were per- ' 
fectly devoted to their profession. 



**®John W. Riley, Jr. , Bryce F. < Ryan, and Marcia 
Lifshitz, The Student Looks at- His Teacher (New Brunswick, New 
Jersey: Rutgers University Press, 1930), p. 6. 
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Similarly, Gohccn reflecting upon the elusive nature of teaching, 

stated that good teaching can never be studied objectively and 

39 

scientifically because of its complexity and personal nature. 

Despite the fact that this type of idiosyncratic criticism was 

usually dismissed by investigators, these claims were bolstered by 

researchers supporting student evaluations. Following their survey 

of the ratings and ideal professor studies, Cohen and Brawer con- 

eluded "that the relationship of teacher behavior to student learning 

is not known and despite decades of research, we have not yet begun 

40 

to understand those influences." Again, Cohen indicated "that the 

whole area of teacher evaluation is beclouded with ambiguity and 

41 

bereft of determinant criteria. According to Shoben, the crux 
of this problem remained "the remarkable lack not only of any 
comprehensive theory of teaching, but also of any definitional con- 
ception of it that gives unity and meaning . . . "to student evaluations 
of teaching. He added that in spite of greater sophistication in 

3®Robert F. Goheen, "The Teacher in the University," 
American Scientist , 54 (February, 1966), 221. 

^Arthur M. Cohen and Florence B. Brawer, Measuring 
Faculty Performance (Monograph Series; Washington, D. C. : 
American Association of Junior College, 1969), p.. 36. 

^ Arthur M. Cohen and Edgar A. Quimby, "Trends in 
the Study of Junior Colleges: 1970, "Junior College Research Review , 

5 (September,- 1970), n. p. 
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statistical techniques and principles of measurement, "there has been 

42 

little investment of conceptual thought in tne teaching process itself. " 



Reformulating the Evaluations Framework 
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Reexamining the concerns of the early formulators of 

ratings research helped clarify this situation, however. In large 

measure the earlier rationale for student evaluations was closely 

associated with student expectations. This meant that instructors 

would learn the expectations of their students from the specific 

.teacher behaviors identified by the end-of-term ratings studies. 

Once learned, this would contribute to faculty effectiveness. Riley 

proposed such an evaluations framework initiated by student ex- 
43 

pectations. With few illusions about the aims and limitations of 

44 

evaluations, Riley asserted: 

It is not our intention to make any claim in support 
of the ability of any student body to make an objective 
and valid analysis of a faculty. That, it must be 
re-emphasized, is not the point at issue. The real 



^Edward J. Shoben, Jr., "Gimmicks and Concepts in 
the Assessment of Teaching, " Improving College Teaching , ed; 
Calvin B. T. Lee (Washington, D. C. : American Council on 
Education, 1967), 292. 

^John W. Riley, Jr., Bryce F. Ryan, and Marcia 
Lifshitz, The Student Looks at His Teacher (New Brunswick, New 
Jersey: Rutgers University Press, 1950). 

^Ibid. , p. 32. 
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point, rather, lies in the assumption that student 
judgments and evaluations, however immature, 
biased or prejudiced they may be, contribute to 
the complex of realities in any teaching situation. 

The professor is dealing with human beings and 
even in the classroom, where he exercizes a high 
degree of control and authority he cannot separate 
these 'beings' from their prejudices and gratifications. 

Even though later research established the reliability of students' 
rating judgment, little concern was expressed for broadening the 
end-of-course research framework. .Riley tried to counter the pre- 
vailing trend by proposing a redefinition of the student- instructor 
45 

relationship: 

Effective teaching can rely on no standardized system 
of techniques and scholarship; it must take into account 
the peculiar nature of the student. What the student 
• hears is more important than w h at the professor ‘says ; 
what the student sees is more important than what the 
professor doe s. The student's definition of the professor 
is as tangible a part of the instructional situation as are 
the skills and knowledge of the professor. Whatever 
the goal of instruction, a sound working relationship 
between student and professor is necessary for the 
fulfillment of that goal. 

* . 

While Riley contended that instructors would actually learn student 

expectations by studying the traits of the ideal professor, he emphasized 
the interaction of student and teacher, with student expectation inputs 
playing a continuing role in the dynamics of the classroom. Supporting 



^John W. Riley, Jr., Bryce F. Ryan, and Marcia Lifshitz, 
The Student .Looks at His Teacher (Mew Brunswick, New Jersey: 
Rutgers University Press, 1950), p. 33. 
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this, Riley advanced that: 

An integral complement to the rating scale itself has 
been the student's expression of the instructional 
ideals. Without such a yardstick the full significance 
of the rating on any attribute is lost, for we would 
not know whether or not the attribute itself held high 
value for the student. This point takes on added 
meaning to the individual instructor who studies his 
own evaluation when we recall that some attributes 
are undoubtedly 'competitive' with others. Thus if 
organization is wittingly sacrified for the sake of 
stimulating ins i ghts generated in the classroom, 
the professor may find it useful to know the sig- 
nificance attached to each of these partially competing 
attributes. (Italics mine. ) 

Finally, Riley recognized the explicit consensual relationship between 

47 

expectations and evaluation: 

If the student is to be given some systematic opportunity 
. to evaluate his professor, he must at the same time be 
allowed to express his ideal expectations of the professional 
function. For one without the other becomes only a half 
statement and may be quite meaningless . There is, for 
example, one conclusion to be drawn from a student who 
is critical of a professor who has failed to give what 
the student most wanted, i. e., stimulation to individual 
thinking; but quite a different conclusion is demanded if 
the student expressed little regard for this pedagogical 
result but rather pinned his hopes on 'learning the subject 
.matter.' (Italics mine. ) 

Riley's contributions provided an answer to the initial question posed 



46 John W. Riley,. Jr. , Bryce F. Ryan, ’and Marcia Lifshitz, 
The Student Looks at His Te a cher (New Brunswick, New Jersey:. 
Rutgers University Press, 1950), p. 57. 




•Ibid., p. 61. 



in this chapter: what did evaluations measure? Certainly they 
measured student role expectations of instructors. It was 
therefore timely to develop an "evaluations perspective that went 
beyond end- of- course instructor ratings. A more inclusive and 
explicit evaluative framework for college instruction proved 
necessary, a framework that included student expectations. 
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CHAPTER II 



A FRAMEWORK FOR STUDENT ROLE EXPECTATIONS 

OF INSTRUCTORS 



Despite Riley's proposals, few evaluations researchers ex- 
amined the problem of how students entering a particular course orient 
themselves to the instructor. Moreover, few studies investigated 
student definitions of their role-oriented tasks, especially in association 
with the classroom, curriculum and faculty; in a word, an "evaluations 
ambience."* Yet no investigator challenged the assertion, that "the 
students' orientation, their expectations and role definitions are likely 

to provide considerable insight into how the students will adapt to 

2 

the demands made upon them. " 

Even with the increased investigations related to student 
expectations of the overall college environment, the absence of 



*W. J. McKeachie, "Research on Teaching at the College and 
University Level," Handbook of Resear ch on Teaching , ed. N. L. Gage 
(Chicago: Rand McNally and Company, 1963), 1118-1172. jCf. D. Bob 
Gowin and Donald E. Payne, "Evaluating Instruction: Cross Perceptions 
of College Students and Teachers, " School Review , 70 (Spring, 1962), 
217-219. 

2 

F. M. Katz and Cecily N. Katz,* "Great Expectations, u 
Universities Quarterly, 23 (Autumn, 1969), 420. 
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research concerned with student expectations of instruction persists. 

Researchers seemed to take for granted that students' total expectations 

of courses and instructors contributed to the successes and failures in 

* 3 

their college experience, without apparent evidence. Although such 

college environment studies suggested "that people work more effectively 

4 

•in situations that conform to their preferences, " , there still remained 

inadequate evidence explaining expectations held for specific courses 

and instruction. College environment studies indicated, nevertheless, 

that if a ‘student's expectations’ were realistic, the process of adaption 

to the college environment became relatively smooth, and was likely 

5 

to be- reflected in successful fulfillment. This point served addition- 
ally as a reason to examine expectations related to courses and 
instructors. Since this study assumed, however, that the student 
dimension related to classroom expectations was a broad one, it con- 
cerned itself only with student role expectations of college teachers in 
relation to student evaluations, in a particular course of study. 



®F. M. Katz and Cecily N. Katz, "Great Expectations, " 
Universities Quarterly, 23 (Autumn, 1969), 420. See also Lawrence A. 
Pervin, "Reality and Non- reality in Student Expectations of College," 
Journal of Psycholog y, 64 (September, 1966), 41-48. 

4 lbid. , p. 420. 

^Kenneth A. Feldman and Theodore M. Newcomb, The Impact 
of College on Students (San Francisco: Jossey-Bass, Inc., 1969), I, 81-82. * 
See IC. A. Feldman, "Studying the Impact of Colleges on Students," 
Sociology of Education, .42 (Summer, 1969), 207-237. 
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To be sure, the present study could only be a first approx- 
imation of student expectations of specific courses and instructors. 
Accordingly, it examined the relationship between student expectations in 
certain classes at the start. of a semester compared with their subse- 
quent ratings toward the end of that same semester. This procedure 

6 

contrasted with the typical research design of student evaluations. 
Invariably these investigations sought student responses on instruments 
designed to rate the instructor upon--or near--the completion of a course. 
From the collected data, researchers drew their conclusions concerning 
the influence of intervening variables on their evaluations. Operationally, 
this meant that a student’s initial expectations were to be inferred from 
an end-of-term rating. From the point of view developed in this paper, 
end- of- course ratings should be examined as an integral part of an 
evaluative process that includes measuring initial student expectations. 

The Classroom as a Consensual Experience 

Newcombls formulation of "consensual expectations" was most 

7 

useful in making this evaluative process more inclusive. Although 

* 

®H. H. Remmers, "On Students' Perceptions of Teachlrs' 
Effectiveness," The Appraisal of Teaching In Large Universities , ed. 

W. J. McKeachie (Ann Arbor: University of Michigan Press-, 1959), 

17-20. 

7 Theodore M. Newcomb, "Student Peer-Group Influence," 

The American College , ed. Nevitt Sanford (New York: John Wiley 
and Sons, 1962), 469-488. 
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Newcomb concerned himself with shared peer group influences in re- 
lation to learning outside of the classroom, his formulation served as 
a bridge connecting student expectations and teacher objectives. Borrow- 
ing this model of shared relationship for the teaching-learning framework 
proved to be a working model for classroom research. (If developed 
• as a research model, it might become an integral factor in the process 
of student- college adaptation and outcomes.)- In comparison with this 
open-ended framework, Rudolph expressed deep concern for the tra- 
ditional exclusion of students from the teaching-learning process both 
in practice and research: "And what is most distressing of all is how 
often in our history students have had to tell us of their presence- -of 
their needs as young human beings discovering the limits of their 
individual destinies. " It seemed appropriate, therefore, to develop 
an evaluations research model focusing primarily on student inputs. 

Unlike research models characterizing student inputs as achievement 
or performance levels, intelligence quotients, or socio-economic status, 
the consensual expectations idea suggested that initial expectations 

added to a more inclusive understanding of„the student's domain, es- 

% 

pecially in relation to their evaluations of college courses. 

One problem faced by employing the consensual expectations 

^Frederick Rudolph, "Neglect of Students as a Historical 
Tradition, " The College and thc Students , eds.. L. E. Dennis and J. F. 
Kauffman (Washington, D. C. : American Council on Education, 1966), 

58. ' 
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approach as an evaluations model was the limited empirical findings 
on shared student- instructor interaction in the classroom. Surely no 
researcher--or practitioner- -overtly denied the centrality of this 
relationship, actual studies of student expectations of college had 
usually considered non- instructional characteristics, such as dormi- 
tory regulations or social and extracurricular activities, treating the 
academic variables in more panoramic tferms. Even the widely used 
College Characteristics Index developed by Stern and Pace contained 
faculty- related items of so general a nature as to be of little value 
in discussing specific behaviors related to instruction. The Index 
used such items as: "Some of the professors treat questions in class 
as if the students were criticizing them personally," or "Many faculty 
members seem moody and hard to figure out. These items certainly 
related to the perception of an overall environment, but added little 
understanding of specific student expectations of instructors. Used to 
demonstrate increasing student disaffection and alienation from college, ^ 
these studies did not measure whether failure to meet student instruc- 
tional expectations was connected to instructors and courses as such, 

and not with generalized institutional dynamics. Instead, consensual 

. — • • 

^George C. Stern,- People in Context : Measuring Person--Environ 
ment Congruence in Education and Industry (New York: John Wiley and 
Sons, 1970), p. 343. 

^George C. Stern, "Myth and Reality in the American College, " 
AAUP Bulletin , 52 (December, 1966), 403-414. 
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expectations suggested that student instructional evaluations could be 
more appropriately understood when treated as a function of their 
expectations. 

Correspondingly, the research aim of consensual expectations 
suggested expanding current notions of student evaluations of instruc- 
tors. ** Here it followed recent models associated with innovations 

in undergraduate education, specifically those models emphasizing 

12 

teacher-learner interaction. Usually organizational theory, most 
« • 

often the process or systems models, supplied the conceptual frame- 

13 

work for various innovative strategies. By contrast, previous 
•evaluations studies presumed the equilibrium model of classroom 
organization. This unidimensional approach operated with the least 
possible disturbance: the instructor taught and the students learned. 
Consequently, end-of- course ratings- were not conceived as reciprocal 
or mutually derived task fulfillments, but measured teacher behavior 



* * Joseph Axelrod, Model Building for Undergraduate Colleges , 
U. S. Office of Education Contract No.. OEC 6-10-106 (Washington, D. C. 
Educational Resources Information Center, 1969), pp. 1-32. 

12 

Douglas H. Heath, Growing Up in College: Liberal 
Education and Maturity (San Francisco: Jossey-Bass; Inc., 1968), 
pp. 20-48. 

1 O 

Joseph Axelrod, Mo del Building for Undergraduate Colleges , 
U. S. Office of Education Contract No. OEC 6-10-106 (Washington, 

D. C.: Educational Resource Information Center, 1969), pp. 1-32 and 
passim. 
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exclusively. The process model or complex,- adaptive system, thriving 

on "disturbances and variety" in the environment, offered an evolving 

framework with potential for high order student and teacher inter- 
14 

change. If the process model contributed this concept of interchange 
to consensual expectations, the systems, model proposed the concept of 
feedback. Essentially this meant that student and teacher monitor and 
reassess each other's expectations and objectives regularly. Taken 
together, the process and systems models permitted student involve- 
ment in the articulation of instructional objectives, in addition to 
producing instructor awareness of student dispositions, with some 
provision for feedback. Typical end-of-term ratings instruments might 
be included, too. But not without student and teacher becoming mutually 
responsible for the fulfillment and assessment of each other's stated 
and mediated expectations and objectives. In these t^rrns, then, the 
structure of consensual expectations would be determined largely by 
interchange and feedback. The evidence demonstrated that this task 
should be realizable for both large and small institutions. For, as 

Eddy's findings in 1959 indicated, which Feldman and Newcomb re- 

* 

stated in 1969: "The extent to which the student comes into direct 

. * 

contact with the professor depends more upon the attitudes and efforts 

^Walter Buckley, Sociology and Modern Systems Theory 
(Englewood Cliffs, New Jersey: Prentice-Hall, Inc., 1967), p. 40. 
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of the institution than upon its size. 

Understandably, there would be some lag between the growth of 
these role- sharing typologies in research and their acceptance in 
practice. In fact, in anticipating possible colleague criticism, in- 
vestigators of student evaluations often assumed a defensive research 
1 6 

posture. In 1961, and again in 1967, Gustad remarked that faculty 

criticism persisted even after end-of-term ratings appeared on more 

17 

college campuses. In their recent study Cohen and Brawer reported 

that "a somewhat cynical opinion among some teachers that very little 

18 

value can be placed on student ..." judgment still prevailed. 

Proponents of student ratings contended. that faculty criticism 
stemmed largely from the belief that students lacked maturity to 
formulate sound evaluative judgments. Two major criticisms of 



1 5 

Kenneth A. Feldman and Theodore M. Newcomb, The 
Impact of College on Students (San Francisco: Jossey-Bass, Inc., 

1969), 1. 580. 

16 

Charles E. Gray, "The Teaching Model and Evaluation of 
Teaching Performance, " The Journal of Higher Education , 40 (November, 
1969), 636-642. 

17 

John W. Gustad, "Evaluation of Teaching Performance: 
Issues and Possibilities, 11 Improving College Teaching , ed. Calvin B s '< 

T. Lee (Washington, D. C. : American Council on Education, 1967), 
276-277. 

1 O 

Arthur M. Cohen and Florence B. Brawer, Measuring 
Faculty Performance (Monograph Series; Washington, D. C. : 

American Association of Junior Colleges, 1969), p. 11. 
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of student evaluations summarized by McKeachie were: 

1. Students really can't evaluate a teacher until 
they've left college and gotten some perspective 
on what was really valuable to them. • 

2. Students rate teachers on their personality-- 
not on how much they've learned. 

Though critical, Megaw offered an explanation for faculty resistance: 

"Is it not unaccountable that so many people so openly committed to 
enlightenment should for so many years put up with such ignorance 
about a procedure j_ evaluation^/ so close to the heart of their common 
endeavor?" 29 He disclosed two "main faculty motives . . . fear and 
laziness," explaining "the nightmarish fear of being declared in- 
competent, or at least shamefully inexpert . . . . " Regarding 
laziness as less intense than fear, but "more endemic, " he continued, 
"not a general, undifferentiated laziness, however--most teachers 
put in a long working day--but a special laziness of the experimental 
spirit: reluctance, in short, to consider new patterns of overwork. " 

In defending "the honorable faculty motive for resisting" evaluation, 

he added, "/""their”? loyalty to that which goes on in the classroom is 

• . 

vastly more complex than any definition of it, and that its chief 

19 W. J. McKeachie, "Student Ratings of Faculty, " AAjJP 
Bulletin , 55 (December, 1969), 439-444. 

. 20 Neill Megaw, "The Dynamics of Evaluation," Improving 
College Teaching , ed. Calvin B. T. Lee (Washington, D. C.: American 
Council on Education, 1967), 282. 
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values reside in this complexity." If Megaw's explanation, as noted 
by others, ^ that currently understood end-of-term ratings erode 
classroom complexity, they deserved the criticisms received. 

By contrast, the consensual expectations approach would 
require measurement of instructor evaluations as a function of initial 
student expectation inputs. If the research being proposed by con- 
sensual expectations demonstrated that students were reliable classroom 
participants, then certain faculty objections to this evaluative process 
might be dispelled. Furthermore, consensual expectations would 
certainly add, not detract, from the classroom complexity cited by 
Megaw as a faculty objection to evaluations. 

Consensual Expectations and Group Dynamics Theory 

The theories of group dynamics ‘provided the conceptual frame- 
work for this research on student expectations. Clearly, if any theory 
were to make a contribution, it must relate expectations to the role 
relationships in the college classroom. Despite its concern with an 
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Neill Megaw, "The Dynamics of Evaluation, " Improving 

College Teaching, eci. Calvin B. T. Lee (Washington, D. C. : American 
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Kenneth E. Eble,. The Recognition and Evaluation of 
• Teach ing, Report of the Project to' Improve College Teaching, Salt 
Lake City, Utah, November, 1970 (Salt Lake Cityi Utah: Project 
to Improve College Teaching, 1970), pp. 1 8-’l 9. . 
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industrial setting, Stogdill's work on group achievement provided the 

93 

appropriate framework. Like Newcomb's consensual expectations 
framework, Stogdill's work has been identified in the literature of the 
field as systems and open-ended theory. For Stogdill group per- 
formance and achievement were determined by initial member inputs, 
that is, expectations. Hence, his expectations formulation served 
as a working model for the determinants of student expectations. 



Influenced by earlier sociological theorists such- as Mead, 

Stogdill's definition of expectation could be traced to the work of the 

26 

learning theorists, Tolman and Kelly. Stogdill defined expectation 

as "a readiness for reinforcement" which "is a function of drive, 

the estimated probability of occurrence of possible outcome, and the 

97 

estimated desirability of the outcome. "“ By drive he meant "the 
level of tension reactivity exhibited by an organism, ” which would 
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Ralph M. Stogdill, Individual Behavior and Group 
Achievement (New York: Oxford University Press, 1959). 
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Dorwin Cartwright and Alvin Zander,. "Introduction, " Group 
Dynamics: Research and Theory , eds. Dorwin Cartwright and Alvin 
Zander (3d ed. ; New York: Harper and Row, Publishers, 1963), 26. 
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become activated into readiness for reiaforcement defined as "the 

extent to which an individual is prepared or unprepared to experience, 

28 

or reconciled to the prospect of experiencing, a possible outcome. " 
Actual outcomes would be mediated by an individual's estimates of 
occurrence which "refers to the individual's prediction, judgment, or 
guess relative to the likelihood that a given event will occur, "as well 
as by the estimated desirability which he defined as "an individual's 
judgment relative to the satisfyingness of, need for, demand for appro- 
priateness of, or unpleasantness of, a possible outcome." 3 In the 
classroom these two estimates determined student, expectations 



associated with projected outcomes of teacher behaviors. Aware that 
some question might be raised regarding the positive or negative 
valence of the estimates, Stogdill added that the "estimate of prob- 
ability and estimates of desirability are not opposite ends of the same 
continuum .... " 30 Instead, these estimates "interact to determine 
the level of expectation." 3 * And most relevant to the classroom, "this 
interaction is formulated in terms of what is uncertain in the future as 
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Ralph M. Stogdill, Individual Behavior and Group ' 
Achievement (New York: Oxford University Press, 1959), p. 62. 
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well as in relation to what has been learned in the past. " 

If this concept of expectations was understood as an inter- 
action of probable and desired estimates, the particular behavior 
outcomes were dependent upon the classroom role structure. Accord- 
ingly, Stogdill first distinguished between the place of the formal 
structure of groups, making it possible to determine the different 

j 

j • • 

expectations associated with individual instructors. Stogdill then 
discriminated between expectations "attached to position rather than 



the occupant of a position. " Since the classroom's formal structure 
prescribed the professor's status and function which "are defined for 



Stogdill emphasized the relevance of expectations related to role structure. 
Role structure, in short, was the more inclusive concept. By distinguish- 
ing between formal and role structure, Stogdill's approach contributed 

/ 

additionally to evaluations research. For if the ideal professor studies 



identified classes of behavior states associated with the formal in- 
structional expectations, then the study of their interplay in actual class- 
room settings was one to which this study addressed itself. 



Furthermore, in keeping with previously developed organization - 



theory, Stogdill introduced the concept of responsibility and authority 



33 

a position rather than for any given occupant of the position," 




33 Ralph M. Stogdill, Individual Behavior and Group Achievement 
(New York: Oxford University Press, 1959), p. 128. 

33 Ibid. , p. 123. 
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to differentiate two levels of role structure expectations. Responsi- 
bility, he defined as the set of performances that a given occupant was 
expected to exhibit "by virtue of the operational demands made upon his 
position," contrasted with authority representing the degree of freedom 

that the occupant of a role-could be expected to exercise as an in- 
34 \ 

dividual. In effect, both the probable and desirable estimate of 

behaviors defined by the classroom's role structure would produce "a 

set of expectations " which "imply, not a psychic entity of any sort but a 

..35 

mutually confirmed readiness for reinforcement .... 

Student expectations as proposed by Stogdill were used in this 
paper as a role- related set of probable and desirable estimates of in- 
structor behaviors for which students exhibited a readiness for 
reinforcement. This formulation permitted classroom role expectations- - 
and subsequent evaluations- -to be understood as both a function of the 
normative aspects of a classroom coupled with individual instructor 
characteristics. The interaction of student- estimates of desirable and 
probable instructional behavior outcomes/ in other words, not only 
determined evaluations, as the ideal professor studies suggested, but 
also activated student expectations. Thus, the concept of expectations 
employed here included the readiness for projected behavior 
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34 Ralph M. Stogdiil, Individual Behavior and Group Achievement 
(New York: Oxford University Press, 1959), • p. 129. 



35 



Ibid., p. 128. 



I- 

fir 



"O 

ERIC 



* 



41 



outcomes. . 

% 

It was obvious that the model adopted by the end- of- course 
ratings studies limited the understanding of expectations in relation 
to evaluations. By excluding estimates of probability and desirability, 
these studies failed to recognize that student expectations precede 
their evaluations. According to the definition of expectations employed 
here, an affirmative or negative rating in an instructional behavior 
category could have exceptional meaning if initial expectation inputs 
were taken into account. In the ratings studies, however, there seemed to 
be no way to ascertain whether students expected a teacher to be friendly, 
even though he in fact turned out to be quite friendly. The students 
then rated him affirmatively. The student or class could be, nonetheless, 
expecting to achieve more organized knowledge about the subject than 
the instructor gave for which students rated him negatively. In the 
ratings studies the tendency would be to equate the two results, perhaps 
one equalizing or cancelling the other. Extrapolations of student ex- 
pectations made from their evaluations could be, in sum, of limited value. 

Building upon this perspective, Stogdill's formulation as 
open-ended group theory contributed to an understanding of the consensual . 
expectations point of view. Here Stogdill's relationship with evaluations 

^Joseph Berger, Thomas L. Conner, and William L. McKeown,- 
"Evaluation and the Formation and Maintenance of Performance Ex- 
pectations, " Hurrmp_Rel^tions, 22 (December, 1969), 481-502. 
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should be noted. For if the systems-process models suggested that 
evaluations are a function of interchange and feedback based on initial 
inputs, then Stogdill's notion of role structure offered an explanation 
of the organization of expectation inputs. Analogously, if the systems 
approach explained the structural and interactive dimensions of con- 
’ sensual expectations, then Stogdill defined the situation that determined 
expectations. 

A recent study by Phillips developed procedures resembling 

37 

those suggested by consensual expectations approach. As such, it 

represented the only one attacking the problem of student evaluation as 

a function of desired instructional outcomes compared with actual ratings 

38 

Phillips noted that: 

The outcomes of teaching would seem to be complexly 
determined by at least four factors: the characteristics 
of the teacher, the students, the subject matter, and 
the class as a group. And, what is more important, 
it appears to be the interaction of these factors which 
partly produce, differences in outcomes in teaching. 

Adding that since "the importance of a particular criterion of teaching 

effectiveness is likely to vary from student to student, and from class 

* 

to class, " Phillips maintained the end-of-term gatings studies, "may 



07 

Beeman N. Phillips, "The 'Individual* and the 'Classroom 
Group' as Frames of Reference in Determining Teacher Effectiveness," 
Journal of Educational Research, 58 (November, 1964),. 128-131. 



S^Ibid. , p. 128. 



often provide an overgeneralized estimate of a teacher’s effectiveness. M 
To change the direction of this research, he asked students in eight 
sections of an educational psychology course toward the end of the term 
to complete two instruments, one to determine "what happened in this 
course," and the other to determine which experiences students "considered 
to be most important in the courses generally." For purposes of 
analysis, Phillips clustered the instrument items according to studies done 
at the University of Michigan in which four classroom factors were de- 
rived using factor analytic techniques with a relatively large sample of 
41 

students. These factors were: amount of structure provided; amount 
•of information given about tests; amount of achievement motivation 
aroused; and, amount of personal warmth in class. By clustering items 
in this way, Phillips noted that a more realistic picture of a student’s 
expectations would be achieved, as opposed to a more generalized picture 
usually given by an item by item analysis. He reported significant re- 
sults at the .01 level in all four categories: students expected more 
structure and achievement motivation arousal which their experience did 
not reinforce, and students expected less information about tests and 

^Beeman N. Phillips, "The ’Individual’ and the ’Classroom 
Group’ as Frames of Reference in Determining Teacher Effectiveness," 
Journal of Educational Research, 58 (November, 1964), p. 128. 
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personal warmth in class which was overly reinforced by the instructors. 
Unfortunately, his design was not longitudinal because the two tests 
were taken successivelyxand limited because he asked students to use 
an ideal course framework for the expectations instrument instead of, 
the actual course. Phillips, nonetheless, pointed ip the direction of 
'further explanation in expectation- evaluation studies. In anticipating the 
consensual expectations model, Phillips saw the need for additional re- 
search of a longitudinal type. 

* * 

Phillips' study dealt with the interactive qualities of expectations ' 

•and evaluations, another one by Twa concerned itself with role conditions 

43 

that determine expectations. It examined the expectations of community 

college- students in relation to different instructional roles, namely, 

transfer instructors, occupational instructors, and adult education instructors. 

Twa contributed a persuasive argument for utilizing the consensual ex- 

44 

•pectations framework: 

42 

Beeman N. Phillips, "The 'Individual' and the 'Classroom 
Group' as Frames of Reference in Determining Teacher Effectiveness, " 
Journal of Educational Research , 58 (November, 1964), p. 131. 

43 

R. James Twa, Student and Instructor Expectations of 
Community College Instructors (Eugene, Oregon: Oregon School Study 
Council, 1970). 

44 - 

Ibid. , p. 28. 
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Persons have expectations of others' with whom they 
must interact to attain their goals. Conformity to 
these expectations normally result in rewards, and 
non-conformity normally results in negative sanctions 
being applied to the person. However, in the student- 
instructor situation a complication arises because the 
students may jeopardize their chances of goal attainment 
if they (as subordinates) apply negative sanctions to 
instructors (superiors). Thus,' because of their in- 
ability to take action, the student's frustration is 
heightened. 

% 

Twa concluded that the teaching-learning relationship is a consensual 

one: "Each is dependent upon the other to fulfill successfully the 

objectives for his respective mission. " 4 ^ The findings of his study 

disclosed that students place great importance on the personal relation- 

46 

ships they expect of instructors. 

Relating Classroom and College Expectations 



Despite its concern with institutional and environmental 

variables. Stern's work added another dimension to the concept of ex- 
47 

pectations. Stern developed a College Characteristics Index to 
determine student expectations of .the demands or pressures of a college's 



^R. James Twa, Student and Instructor Expectations of 
Community College Instructors (Eugene, Oregon: Oregon School 
Study Council, 1970), p. 5. • # 

46 Ibid. , p. 26. 

• # 

47 George C. Stern, People in Context: Measuring Person- - 
Environment Congrue nce in -Education and Industry (New York: John 
Wiley and Sons, 1970). 
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46 



. 48 

environment, called press. The Index also permitted Stern to determine 

49 

aspects of individual personality, called needs. Accordingly, the 
needs scale attempted to identify "the goals or purposes that an inter- 
action serves for the individual. In this sense a listing of needs is 
essentially a taxonomy of the objectives, that individuals characteristically 
strive to achieve for themselves."^ The concept of environment press, 
as a corollary of needs, was defined by Stern as "the phenomenological 
world of the individual, the unique and inevitably private view each 
person has of the events in which he takes part." In order to differ- 
entiate between an individual's press expectations, and those presumably 
posited in the institution. Stern made a distinction between "explicit 
objectives , representing the stated purposes for which given institutional 

events are organized, and the implicit objectives , which are in fact 

/ 

52 

served by institutional events."- Thus the taxonomy of- a student's 
expectations, which the Index measured for given institutional elements. 



^George C. Stern, People in Context: Measuring Person- 
Environment Congruence in Education and Industry (New York: John 
Wiley and Sons, 1970), p. 7. 



SOlbid., p. 6. 
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such as faculty, coulcl be different from a "detached observer." 

That is, a student’s expectations of "the situational climate, the per- 

missible roles and relationships, the sanctions and so on, "could differ 

54 

from the detached observer, presumably the investigator. To 
apply this typology in practice. Stern suggested that the congruence or 
dissonance of student expectations could be compared with the reality 
of an institution’s matrix.' 

Regarding the curricular and academic expectations of college 
freshmen, some additional questions the Index raised related to faculty 
should be noted: were students treated formally or informally by faculty? 
Were faculty demands upon students heavy or light? Did the general 
teaching procedure emphasize lectures or free discussions? Stern’s 
use of the Index on a large sample of freshmen at Beloit College, 

Cazenovia College, St. Louis University, and Syracuse University, seemed 

55 m 

most relevant here. He advanced the proposition that a Freshman 

Myth" existed in American colleges, for they "share stereotyped ex- 
pectations of college life that combine some of the elite liberal arts 
college with the community spirit, efficiency and social orderliness of 
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56 

church- related schools." More emphatically, he declared: 

"University-bound high school seniors evidently share a highly idealized 

57 

image of college life representative of no actual institution at all." 

Moreover, despite the fact that the four colleges in his sample were 

quite different, "the expectations of the four groups of freshmen follow 

58 

a substantially similar pattern." These freshmen "look forward'to 
high levels of activities relevant to both academic and nonacademic 
press . . . . " Yet, "this does not correspond to the actual character- 
istics of these schools at all. It is evident that the incoming freshman 

.expected something rather different from his upper division colleagues . . 

* 

As an entering freshman, he came expecting to learn, as a senior he 

60 

has learned not to expect so much. " 

In order to consider these findings a distinction had to be 
made between the two differing dimensions of expectations being treated, 

. The expectations dimension examined by Stern, associated with the 



^George C. Stern, People in Context: Measuring Person — 
Environment Congruence in Education and Industry (New York: John 
Wiley and Sons, 1970), p. 173. 
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over- all press of an institution, should be called situational expectations. 
By comparison, the dimension of expectations put forward by Stogdill 
and represented by consensual expectations, which were linked with 
group role structures, should be called contextual expectations. Even 
if Stern differentiated between explicit and implicit group objectives, 

I 

resembling Stogdill's formal and role structure expectations, these 
objectives went beyond reference to specific classroom role contexts. 

. This distinction made a comparison between the two concepts 
possible: did freshmen contextual expectations differ significantly from 
other classmen? In turn, were the subsequent freshmen evaluations 
significantly different from their expectations lending support to the 
iricipent alienation thesis Stern projected as part of a general fresh- 
men disenchantment? Moreover, did the freshmen exhibit the remark- 
ably similar contextual expectations of their instructors as Stern's 
sample exhibited for the over- all situatiortal ones? Answers to these 
questions presented by this study could invite further research con- 
cerning contextual and situational expectations in varying aspects of 
college culture. 
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CHAPTER III 



DESIGN, RESEARCH PROCEDURES AND HYPOTHESES 



As a study of student contextual expectations of college in- 
structors, the design was determined by the natural setting of the class- 

i 1 

room. For college classroom research Gage distinguished three types 

of appropriate study designs. 1 First, descriptive studies depicting 

aspects of the teachingrlearning process. Second, experimental research 

manipulating certain aspects of the classroom situation--the independent 

variable--in seeking measurable changes in students. And conversely, 

experimental research relating changes in student--the dependent variable 

2 

--to some measure of the teacher’s behavior or classroom experience. 

In adopting the latter method, this study treated expectations as- the 
independent variable, and evaluations and certain other variables to be 
noted later,- as the dependent ones. 

Study Design and Procedures w 

The study also used a one- semester longitudinal design. At 

*N. L. Gage, M The Appraisal of College Teaching: An 
Analysis of End and Means, ”• Journal of Higher Education, 32 (January, 
1961), 17-22 

' i 

2 lbid. , p. 21 • ' 
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the beginning of a semester a sample of college students reported their 

expectations, and after an interval, their evaluations were determined. 

This method avoided the possible instrument contamination noted re- 

3 

garding Phillips 1 study of testing and posttesting on the same day. 
Furthermore, since the present study treated evaluations as a function 
of expectations, the posttest could have been given at any point during 
the semester. 

Arrangements were made in the Spring of 1970 in three 
colleges and with eight instructors in whose introductory social science . 
classes this study was conducted. Selection of the instruccors was 
based on two considerations. One, the subject matter in each course 
was relatively equivalent, and, two, the instructors employed discussion 

1 y- 

methods£predominantly. The author determined this point by a prior 
classroom visit and discussion with the participating instructors. More- 
over, an introductory course was chosen because it enrolled a high 
percentage of freshmen, making any finding regarding their contextual 
expectations more representative. Divulging only that the study aimed 
to learn more about students taking social science courses, the author 
requested the instructors not to discuss the study or the instruments, 
which remained unseen, with either students or colleagues. Havirfg 
also agreed not to reveal to their students that they would participate 

^Beeman N. Phillips, "The ’Individual 1 and the ’Classroom 
Group* as Frames of Reference in Determining Teacher Effectiveness, “ 
Journal of Educational Research, 58 (November, 1964), 128. 



in this study, each instructor introduced the author on the second day 
of class, and then left the room; the expectations instrument was then 
administered. Students were not told that the instrument would be 
retaken with modifications for evaluations purposes following ten weeks 
of classes. Insofar as can be determined, the instructors cooperated 
fully by not discussing the study with students. In two classes with 
absentees on the posttest, the. instructors asked the author to return 
on the next class day so the students could complete the schedule. 

This longitudinal design was coupled with an interinstitutional 
sample. Although most evaluations research was based on single- 
institutions, Astin recently advanced that the multi- institutional approach 

4 

acted as a modified control, and should therefore be encouraged. The 
three colleges were chosen primarily because of their differences in 
character and clientele. Chosen because they shared the explicit in- 
stitutional goal of stressing the teaching function, they were all, to be 
sure, located conveniently within a fifty- mile radius.. Alpha College, 
a seventy-year old two-year women’s residential college with a somewhat 
selective enrollment close to 400, offering both liberal arts transfer and 
terminal programs in the fine, applied and performing arts, and early 
childhood education, had four of its classes’ surveyed. Abo,ptrv?5 students’ 
were in each class. Beta College, by contrast, a comprehensive 
community college about a dozen years old with a full-time enrollment of 

^Alexander W. Astin, "The Methodology of Research on College 
Impact, M Part I, Sociology of Education , 43 (Summer, 1970), 251 



53 



/ 

{ 



2, 500, having an open- admissions policy had two classes surveyed. 

Both had about 25 students. In comparison with these colleges. Gamma 
College, a one- hundred year old four-year women’s residential college, 
recently turned to co- educational, enrolling 200 males out of 1, 600 
students, with some strong liberal arts programs, and a' highly selective 
admissions policy, had two classes surveyed. One enrolled 25 students, 
the other 40. Owing to the full cooperation of the instructors, all 
students completed both the expectations and evaluations instruments, 
resulting in a total student population of 209. 

*\ . 

Developing an Instrument 



The instrument employed in the study developed principally 
from two different analytical techniques. These techniques were used 
to transform the categories developed in ratings and ideal professor 
studies into specific instrument items. Perhaps the most widely selected 
items were found in the University of Michigan studies which applied 
factor- analytic techniques to items gathered from previous study instru- 
ments. 5 From a pool of 145 items, the Michigan studies derived 34 

6 

items through rotated factor analysis. It may be recalled that Phillips 

, * 

^Robert L. Isaacson, Wilbert J. McKeachie, and John E. 
Milholland, "Correlation of Teacher Personality Variables and Student 
Ratings, " Journal of Educational Psychology , 54 (April, 1963), 110-117, ' 

and R, L. Isaacson, etal., "Dimensions ofStudcnt Evaluations of Teaching," 
Journal of Educational Psychology , 55 (December, 1964), 344-351. 

®R. L. Isaacson, et^al. , .op. cit . , p. 345. 



utilized four instructional categories which he derived from these 

7 

34 items. 

The other attempt to distinguish as well as to cluster items 

within instructional categories employed the Critical Incident technique .. 

8 

developed in a University of Houston study. Beeause its approach 

actually identified items related to contextual expectations of college 

instructors, rather than evaluations per se as the Michigan study did, 

its usefulness to the present study was obvious. 

« 

Based on previous work by Flanagan, Owen defined the 

Critical Incident technique as "a systematic method of observing and 

9 

analyzing human behavior.' 1 Essentially phenomenological in approach, 

Flanagan defined an incident as "any observable human activity that is. 

sufficiently complete in itself to permit inferences and predictions to 

„10 

be made about the person performing the act. To account for situ- 
ational exceptions, Flanagan added: "To be critical, an incident must 
occur in a situation where the purpose or intent of the act seems fairly 
clear to the observer and where its consequences are sufficiently definite 

7 

Beeman N. Phillips, "The ’Individual 1 and the ’Classroom 
Group' as Frames of Reference in Determining Teacher Effectiveness," 
Journal of Educational Research, 58 (November, 1964), 128-131., 

8 

Paul H. Owen, "Some Dimensions of College Teaching: An 
Exploratory Study Using Critical Incidents and Factor Analysis of Student 
Ratings" (unpublished Doctoral dissertation. University of Houston, 1967). 

9 Ibid. , p. 19. ‘ • 

*®Owen, op. cit. , p. 21. 
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to leave little doubt concerning its effect. Used extensively in studies 

of elementary and secondary school teachers# notably in Ryans’ nation- 

wide survey on teacher characteristics, Owen employed structured 

and unstructured procedures to obtain the initial inventory of critical 

t> • • incidents that students associated with college instructors’ behaviors. 

From the resulting list of behaviors, Owen developed an instrument, 

which he subsequently administered along with the Michigan instrument. 

His findings demonstrated a high degree of intercorrelations between 

the items obtained through the critical incident and Michigan factor 

analysis techniques. Likewise, the six categories he turned into clusters 

from the Critical Incident techniques were "substantially verified ..." 

1 3 

with the Michigan categories. 

Fortunately, these findings offered the first empirical evidence 
that students employed similar behavioral frameworks for their ex- 
| • pectations and evaluations of instructors,. Consequently, the current 

. study clustered six categories identified by the Houston Critical Incident 

method for purposes of analysis, in addition to selecting instrument 
items from it and the Michigan factor- analysis technique studies. To 



**Paul H. Owen, "Some Dimensions of College Teaching: An 
Exploratory Study Using Critical Incidents and Factor Analysis of Student 
Ratings" (unpublished Doctoral dissertation. University of Houston, 1967), 
p. 21. 



• 12 

David G. Ryans, Characteristics of Teachers (Washington, 
D. C.: American Council on Education, 1960), pp. 79-83. 




clarify. the major classes of instructor behaviors identified through the 
Critical Incident technique, a description of the six categories follows: 

I. Presentation of Material- Content, Structure and Scope 

Behaviors related primarily to the structuring of the content. 
These include the organization, planning, selection and 
preparation of content; use of supplementary references and 
illustrations; use of practical examples including personal 
experiences; thoroughness of explanations and level of 
difficulty of presentation; apparent knowledge of subject. 

II. Presentation of Material-Student Participation 

Behaviors related primarily to student involvement in 
presentation of material. These include instructors' 
relative emphasis on lecture and/or -class participation, 
student involvement in organizing and presenting material 
and assignments to students which specifically relate to the 
presentation of material. 

# i. 

III. Presentation of Material-Instructor's Style 

Behaviors related primarily to the instructor's individual 
style and choice of techniques of presentation. These 
include level of enthusiasm for the subject and its presen- 
tation; animation; use of humor; speech characteristics; rate 
of presentation; use of visual aids; individual presentation 
techniques and traits. 

IV. Teacher-Student Rapport and Class Interaction 

Behaviors related more to affective components of in- 
structor and student interaction than to subject-oriented 
student participation. These include the instructor's 
approach to formality of class; social distance between 
teacher and student; permissive versus authoritarian style, 
personal interest in and involvement with students and their 
problems; personality characteristics to which students 



14 Paul H. Owen, "Some Dimensions of College Teaching: 
An Exploratory Study Using Critical Incidents and Factory Analysis 
of Student Ratings' 1 (unpublished Doctoral dissertation. University 
of Houston, 1967), p. 47. 
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react; control and discipline in class. 

V. Evaluation of Students 

. Behaviors related primarily to the processes surrounding 
the appraisal of students’ progress. These include the 
adequacy of defining test requirements; practices and 
procedures in grading; type, frequency and content of tests. 

VI. Requirements of Students 

Behaviors related primarily to what is expected of students 
but excluding those having to do with assignments to 
present material. These include adequacy of defining 
course requirements; responsibilities given to, demands ' 
made of outside assignments and expectations of students. 

Before noting the actual instrument items and discussing scaling technique, 

the validity issue related to expectations- evaluations instrumentation 

should be discussed. 

Even though the Michigan and Houston studies validated 
certain items, researchers implicitly assumed that virtually any item 
used on an evaluations instrument was self- validating. Often studies 
made little attempt to cite or report previously developed instructional 
categories or items, apparently assuming that the instructional areas 

were .adequately covered. Since most researchers tended to look 

* 

favorably upon student evaluations, the reluctance to explore in- 

♦ 

structional categories more thoroughly could be explained in terms 



. 15 See Richard R. Renner, M A Successful Rating Scale, " 
Improving College and University Teaching , 15 (Winter, 1967), 12-14/ 
and David A. Strand, l, A Rationale and Instrument for Student Evaluation 
of Classroom Teaching, 11 Journal of National Association of Women 
Deans and Counselors, 30 (February, 1966), 36-39. 
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offered by a student advocate of instructor ratings, that M there are 

1 6 

no bad student course and teacher evaluations . 11 More explicit, 

Remmers asserted that, "by definition, if one is concerned only with 

measuring the perception the students have of instructors, validity 

equals reliability." -1 ‘ In the handbook accompanying the Purdue 

Rating Scale for Instructors with which Remmers is associated, the 

%• 

instructions noted, "there is, of course, no easily available outside 

criterion for this. Since the students' attitudes and perceptions are at 

18 

issue, validity by definition can be equated with reliability." This 
would certainly be the case if the high degree of intercor relation of 
the Michigan and Houston items were acknowledged. Indeed, when, sepa- 
rate classes at Alpha College retook the current study's instruments, 
one after a 10 day and the other a 14 day interval, they produced on 
its seven-point scale reliability coefficients of . 89 and . 87 respectively. 

\ * 

^Philip R. Werdell, Course and Teacher Evaluation (2d ed. ; 
Washington, D. C. : National Student Association, 1966), p. 5, under- 
lined in the original. 

1*^H. H. Remmers, “On Students' Perceptions of Teachers' 
Effectiveness," The Appraisal of Teaching in Large Universities , ed. 

W. J. McKeachie (Ann Arbor: University of Michigan Press, 1959), 21. 

. * 

18 

H. H. Remmers, Manual of Instructions for the Purdue 
Rating Scale for Instruction (Rev. ed. ; Lafayette Indiana: Purdue 
University Book Store, 1960), p. 2. 
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Scaling Technique Employe d 

Developing an appropriate scaling technique for this instru- 
ment proved difficult. There was little consistency in scaling 
techniques for evaluation instruments,. Even when studies utilized 
items from earlier investigations, no uniform pattern of scaling pro- 
cedures emerged. Most investigators applied some form of the forced- 
choice technique with scales ranging from five to ten points; five being 
the most common. Yet, if one study adopted a nine-point sentence com- 
pletion technique with the opposite poles being some variation of, "I 
found the textbook in this course to be; l)Very dull" to M 9) Very 

interesting, " another used adjectives such as "probing" or "sympa- 

•20 

thetic" on an open-ended seven point scale. Some studies mixed 
quantitative and qualitative measures on one instrument. As an ex- 
ample, using a closed five-point scale for the item, "Class time is: 

usually a waste of time" to "always of much value," could produce 

21 

total responses that. mix quantity with quality measures. Similarly, 



1 9 

James N. McClelland, "The Effect of Student Evaluations 
of College Instruction Upon Subsequent Evaluations," California Journal 
of Educational Research , 31 (March, 1970), 89. 

9 

^Richard L. Turner, et al . , "How do Student Characteristics 
Affect Their Evaluations of Instructors," Indiana University School of 
Education Bulletin , 45 (July, 1969), 67. 

21 C. T. Stewart and L. F. Malpass, "Estimates of Achieve- 
ment and Ratings of Instructors," Journal of Educational Research , 59 
(April, *1966), 347. 
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should "poor" matches with "outstanding" for one item, also be com- 

22 

bined with "excellent" on another? It seemed clear that by comparison, 
if the Critical Incident and factor analytic techniques provided highly 
correlated instrument items, there would be a need for more uniform 
scaling techniques. 

One scaling technique already employed by two recent in- 

po 

vestigators was the Osgood Semantic Differential Technique, Its 

bipolar pairs of adjectives had a high degree of reliability and validity 

with college students and was therefore, useful for the items in the 
24 

current study. By employing a seven-point scale, moreover, the sem- 
antic differential permitted greater amplitude at the extremes, and 
provided the mid-point position as well. The only substantive criticism 
raised concerning the scale had been the usual one regarding the neu- 
trality of the mid-point. Yet it appeared, from recent evidence, that 



oo 

Ibid . See also Richard R. Renner, "A Successful Rating 
Scale," Imprcy' lg College and University Teaching, 15 (Winter, 1967), 

13. 



^Charles E. Osgood, George J. Suci, and Percy H. 
Tannenbaum, The Measurement of Meaning (Urbana, Illinois: University 
of Illinois Press, 1957) and James G. Snider and Charles E. Osgood 
(eds.). Semantic Differential Technique: A Sourcebook (Chicago, Aldine 
Publishing Company, 1969). 

24 

James F. Brinton, "Deriving an Attitude Scale from Semantic 
Differential Data," Public Opinion Quarterly , 25 (Summer, 1961), 289- 
295 and Lawrence A. Pervin, "Satisfaction and Perceived Self- 
Environment Similarity: A Semantic Differential Study of Student- College 
Interaction," Journal of Personality, 35 (December, 1967), 623-634. 
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25 

that this did not impair the usefulness of the semantic differential. 

Despite their dissimilar designs, the studies by Lewis and 

Rees presented promising' applications of the semantic differential to 

evaluations studies. Lewis used the bipolar adjectives to determine 

whether college professors were perceived in uniformly stereotypic 

terms, and found that different disciplines evoked significantly diff- 

26 

erent images in students. • Rees’s more elaborate investigation 

utilized the semantic differential on an evaluations instrument based 

27 

on the Michigan factor analysis studies. The. instrument was ad- 
ministered along with a number of standardized personality measurements 
to determine which characteristics might be associated with certain 
ratings. He used an ungrouped sample of 65 students from a sectarian 
university, and found that lower socioeconomic status students from 
stable homes presented the least favorable image of college instructors, 
viewing them as "generally unskilled in teaching ability." Despite 
the limited conclusion, the Rees study not only extended the use of the 

28 E. R. Oetting, "The Effect of Forcing Response On the 
Semantic Differential, " Educational and Psychologi cal Measurement , 

67 (Autumn, 1967), 699-702. 

■% 

^Lionel S. Lewis, "Students’ Images of Professors," ' 
Educational Forum , 32 (January, 1963), 189. 

27 Richard D. Rees, "Dimensions of Students Points of View 
in Rating College Teachers, " Journal of Educational Psycholog y, 60 
(December, 1969), 476-482. 

28 lbid. , p. 481. 
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sematic differential, but also represented the. most comprehensive 

attempt, since Riley, to integrate student input characteristics with 

29 

their images of instructors. 

After adopting the semantic differential technique for scaling 
purposes, the second problem was to select the appropriate bipolar 
adjectives for each item. Since the bipolar adjectives of the semantic 
differential had acquired widespread use and validation, this study 
selected only one bipolar pair per instrument item. With necessary 
modifications being made for the two instruments, both the expectations 
and the evaluations items used the same paired adjectives. Both in- 
struments are appended. The instrument items presented below for ex- 
pectations were classified by Critical Incident (hereafter Cl) categories. 
It should be noted that in all but the first category, which covered the 
widest range of behaviors, there were three items per group. Items 
were identified by their instrument part and number; the semantic differ 
ential paired adjectives are noted in parentheses: 

Cl Category I. --Presentation of Material-Content, Structure 

and Scope 

Part I, Item 3. What do you expect the instructor's knowl- 
edge of the subject to be? (Superior, Inferior) 

. * 

Part II, Item 1. What kind of understanding of the course 
objectives do you expect to have? (Hazy, Clear) 



^^John W. Riley, Jr., Bryce F. Ryan, and Marcia Lifshitz, 
The Student Looks At His Teacher (New Brunswick, New Jersey: 
Rutgers University Press, 1950). 
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Part II, Item 7. Disregarding the instructor and the way the 
course is taught, how would you rate the subject matter of 
the course? (Valuable, Worthless) 

Part II, Item 10. Considering everything, how would you 
expect to rate this course? . (Good, Bad) 

Cl Category II. --Presentation of Material: Student Participation 

Part I, Item 7. What kind of emphasis do you expect the in- 
structor to place on the stimulation of thinking and ideas ? 
(Heavy, Light) 

Part I, Item 8. What percentage of students do you expect to 

finish the required reading prior to class? (Small, Large) 

* * 

Part II, Item 6. How do you expect this course to relate to 
your own life? (Low, High) 

Cl Category III. --Presentation of Material: Instructor’s Style 

Part I, Item 2. Do you expect the instructor to make clear 
how each topic fits into the total course? (Actively, Passively) 

Part I, Item 6. Do you expect to find the instructor using 
enough examples and explanations to clarify the material? 
(Sufficient, Insufficient) 

Part I, Item 10. Considering everything, how would you 
recommend this instructor to a friend? (Bad, Good) 

Cl Category IV. --Teacher-Student Rapport and Class Interaction 

Part I, Item 1. If students request. help, in what manner do 
you expect the instructor to respond? (Pleasant, Unpleasant) 

Part I, Item 4. How do you expect the instructor to react 
to student questions, disagreements or expressions of their 
own ideas? (Approving, Disapproving) . . . 

.Part I, Item 5. To what extent do you expect the instructor to 
maintain student interest- in the course? (Low, High) 

Cl Category V. --Evaluation of Students 
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Part I, Item 9. What kind of emphasis do you expect the 
instructor to place on accumulating factual information? 
(Strong, Weak) 

Part II, Item 5. How do you expect to find the content of the 
tests and exams ? (Unfair, Fair) 

Part II, Item 8. In thinking about your grade, how do you 
expect it to reflect your true worth in the course? (Fair, 
Unfair) 

Cl Category VI. --Requirements, of Students 

Part II, Item 2. In relation to your other courses carrying 
equal credit, how do you expect the amount of study and prep- 
aration time to compare? (Just, Unjust) 

Part II, Item 3. How do you expect to find the content of the 
assigned reading? (Worthless, Valuable) 

Part III, Item 4. What kind of challenge do you expect this 
course to be for you? (Superior, Inferior) 

The twentieth item sought to ascertain student appraisal of both the ex- 
pectations and evaluations instruments. This measure, usually absent 
from evaluations instruments, read: How do you rate the ability of 
this survey to determine your own expectations / evaluations / for 

this course? (Strong, Weak). 

• \ 

Hypotheses and Questions Considered ' 



The major hypothesis confronted the notion that students re- 
mained relatively inert in weighing their actual classroom experience, 
that: . . 

• One , within any class there would be.no statistically significant 




difference in student expectations in the six Critical Incident categories 
and the subsequent student evaluations. 

Year in college was treated as a separate set of hypotheses 
in order to focus directly on the questions raised by Stern regarding the 
so-called ''Freshman Myth. " That: 

Two , freshmen would not exhibit statistically significant higher 
contextual expectations than other classmen in any one 'of the six Critical 
Incident categories relating to the instructor, and. 

Three , freshmen would not exhibit statistically significant 
higher evaluations compared with other classmen in any one of the six 
Critical Incident categories, and. 

Four , there would be no statistically significant difference be- 
ween freshmen expectations and evaluations compared with sophomores 
in any one of the six Critical Incident categories. 

Raising this question regarding the influence of year in college 
on expectations and evaluations led to a consideration of other student 
variables. Accordingly, the expectations schedule included items (see 
Appendix) reporting certain demographic information regarding the 
student population. The relative association of these variables was 
tested in the following hypotheses, that: 

.1 *>■ 

' Five , there would be no statistically significant difference in 
each of the Critical Incident expectations categories for these vari- 
ables: sex, type of secondary school attended, size of secondary 
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school, size of residence city, social science or non-social science 

major, previous instructor ratings experience, and whether tlie course 

was required or elective, and. 

Six , there would be no statistically significant difference in 

each of the Critical Incident evaluations categories for these variables: 

sex, type and size of secondary school attended, size of residence 

• • 

city, social science or non-social science major, previous instructor 
ratings experience, and whether the course was required or elective. 

This study also sought to examine the relationship of certain 
Student values with particular expectations or evaluations categories. 

So in addition to administering the expectations and evaluations instru- 
ments, the Allport- Vernon- Lindzey (AVL) Study of Values was taken by 

30 - 

the student sample. The recent Rokeach Value Study was not se- 
lected because two participating instructors mentioned introducing it 

31 

as an example in a class discussion during the Spring 1970. The 

AVL lent itself well to the current study for it amply demonstrated 

that the student value domain, especially for freshmen remained rel- 

32 

atively constant. Thus, the use of the AVL permitted expectations 
30 

Gordon W. Allport, Philip E. Vernon and Gardner * 
Lindzey, Study of Values (Boston: Houghton Mifflin Company, i960). 

^^Milton Rokeach, "The Role of Values in Public Opinion 
Research, " Public Opinion Quarterly , 32 (December, 1968), 548-559. 

^Walter T. Plant and Charles \V. Telford, "Changes in 
Personality for Groups Completing Different Amounts of College Over 
Two Years," Genetic Psychology Monographs, 74 (August, 1966), 3-36. 
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and evaluations to be compared with a presumably Stable variable. 

The AVL itself measured the relative strength of six values or in- 
terest modes for an individual, and the means for*groups indicate the 
relative strength of the values of the groups in question. To this point, 
would the values measured by the AVL (Theoretical, Economic, 
Aesthetic, Social, Political and Religious) produce similar or different 
expectations? For example, would students having high expectations 
for content and structure, as compared with teacher rapport, yield 
similar AVL values? In short, would the AVL values behave in- 
dependently of the various Critical Incident categories ? 
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CHAPTER IV 

• * 

PRESENTATION OF FINDINGS . 

The findings have been organised and discussed according to 
the hypotheses and questions posed by this study. Since the data will 

# * 4 

be presented in terms of the Critical Incident (Cl) categories for the 
study's expectations and evaluations instruments, they should be re- 
stated here: Category I — Presentation of Material: Content, Structure 
and Scope will be referred to as Content ; Category II--Presentation of ' 
Material: Student Participation will be referred to as Participation; 
•Category III- -Presentation of Material: Instructor's Style will be 
referred to as Style ; Category IV--Teacher-Student Rapport and Class 
Interaction will be referred to as Rapport; Category V-- Evaluation of 
Students will be referred to as Evaluation ; and, finally. Category VI — 
.Requirements of Students will be referred to as Requirements. . Like- 
wise, it should be mentioned that Alpha is a two-year women's college. 
Beta is a comprehensive community college, and Gamma is a fdur-year 
women's college recently turned co- educational. The four Alpha College 
classes will be cited as A, B, C and D; the .two Beta. College classes as 
E and F; and, the two Gamma College classes as G and II. 
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All analyses of reported data were carried out employing 
an IBM Systems 1130 Computer. Conventional statistical procedures 
were used for testing the hypotheses and discussing the questions ex- 
amined by this study. Statistically significant differences were accepted 
at the .05 level of judgment. For each of the hypotheses, two-tailed 
tests were used because the null hypotheses did not include a prediction 
of direction of change. The results of the study will now be presented 
in order of the stated hypotheses. 

Hypotheses Considered 

The first- -and major--hypothesis considered the changes 
in each sample class. To test the null hypothesis of no differences 

between expectations and evaluations for each Cl category, group means 

1 

were compared using the t test between correlated means. In each 

case degrees of freedom were determined by the-number-of-test pairs 

minus one. The relevant data can be found in Tables 1, 2 and 3. In 

Beta and Gamma Colleges significant differences were found in virtually 

all of the categories in each of the classes and college totals. Most of 

% 

the differences were at the .01 level. Alpha College classes reported 

• # 

some significant differences in three of the four classes. The College 

Jjohn H. Mueller, Karl F. Schuessler, Herbert L. Costner, ’ 
Statistical Reasoning in Sociology (2d ed. ; Boston: Houghton Mifflin and 
Company, 1970), p.,417. 
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-Means and Standard Deviations for Expectations and Evaluations Compared by Cl 
Category for the four classes in Alpha College and College total 
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total produced significant differences in four of' the six Cl categories. 

In all, 30 out of 48 expectations -evaluations Cl comparisons reached 
acceptable levels of statistical significance. In only 4 instances in 
Classes A and B did the mean differences for the evaluations out- 
weigh the expectations; of these one instance produced a significant 
difference in the Participation category of Class A. Thus, the null 
hypothesis was rejected for the alternative that differences were to be ' 
found between student contextual expectations and evaluations. 

An analysis of the ranks for the expectations and evaluations 
categories confirmed the alternative hypothesis. This analysis might 
help answer questions raised regarding the reliability of the study 

2 

population. Data reported in Table 4 indicated the rank order (rho) 
computations for the Cl expectations scales in each of the colleges. 

Not only did the data support previous ideal professor studies noting 
that social science students consider instructor Rapport, course Content 
and Style the more important course attributes, but the rank differ- 
ences between the colleges were negligible. Furthermore, the rela- 
tive constancy of the students’ expectations can be seen by comparing 
the rank order (rho) differences of their expectations and their subse- 
quent evaluations. Table 5 reported this data. Despite the varying group 

mean differences actually recorded by each class, the students ranked 

• . • ¥ 

2 

John H. Mueller, Karl F. Schuessler, Herbert L. Costner, 
Statistical Reasoning in So cio logy (2d ed.; Boston: 'Houghton* Mifflin 
Company, 1970), p. 273. 
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the six Cl categories in a rather consistent fashion. This seemed 
remarkable even if one were faced with the fact that the students' ex- 
pectations were meet with uneven outcomes. The.rho ratios for the Cl 
categories for Alpha College was . 93, for Beta . 76, and for Gamma . 94. 
Certainly this would serve as a confirmation of the- substantive hypothesis 

that expectations influence the way students record their evaluations of 

% 

instructors. 

This conclusion seemed to be supported by noting student 

* 

reactions to the expectations and evaluations instruments. Each instru- 
ment contained an item (Part II, Item 9) eliciting student response for the 
ability to determine their expectations and evaluations. This data by 
sex can be found in Table 6, with ajt test for correlated means computed 
for differences. The only difference actually noted indicated that the 
evaluations- instrument proved to be a better indicator of student response. 
Again, despite the significant differences between actual expectations 
and evaluations, student response to the study instruments remained 
relatively constant. 

To test the second hypothesis that there would be no difference 

between freshmen and other classmen group means for each Cl expectations 

• * 

category were computed and reported in Table 7. Because there were so 

few third, and fourth year students in each introductory social science 

• • . • • * 
course, the data were grouped as freshmen and sophomores. TheJ: test 
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t ratios significant at . 05 level. 



-Mean and Standard Deviation Comparisons of Freshmen with Sophomores by Cl Expectation 

Category within Alpha, Beta, and Gamma Colleges 
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for uncorrelated — or independent — means was used for determining 
statistically significant differences. Degrees of freedom were determined 
by summing the two test samples minus two. No differences were noted. 
The null hypothesis was therefore accepted. In this case the null hypothesis 
served as the substantive one. Clearly, the freshmen and sophomores re- 
ported similar contextual expectations. Even if one discounts the fact 

• • 

that students at the two-year colleges would be subject to an "upper- 
classmen" effect, no differences were reported by the four-year college 
sophomores. 

The third hypothesis that there would be no differences in 
freshman and sophomore evaluations was similarly tested and cited in 
Table 8. As can be readily seen, the only differences noted were for 
sophomores — and those were significantly greater. Although sophomore 
evaluations were significantly higher in nine instances, in all but one 
their evaluations were still higher than freshmen. Here again the null 
hypothesis was accepted, for when differences appeared sophomore 
evaluations — not freshmen- -produced them. Comment on this can be 
reserved following the consideration of the next hypothesis. 

Hypothesis four was tested on the point that there would be no 
significant differences between freshmen expectations and evaluations 

. . 3 John H. Mueller, Karl F. Schuessler, Herbert L. Costner, 
Statistical Reasoning in Sociology (2d ed. ; Boston: Houghton Mifflin 
Company, 1970), p. 407. 
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as compared to sophomores. The group means for this comparison by 

college can be found in Table 9* Here the t_ test for correlated means was 

employed. Since there were significant differences to the . 001 level for 

both freshman and sophomore evaluations the null hypothesis became 

untenable. This would confirm the substantive hypothesis of this study-- 

students record their classroom experiences in similar way's. Taking 

% 

Beta College, for example, it did not make much difference whether 

freshmen had lower expectations than sophomores, the actual experiences 

produced significantly lower evaluations for both groups. In summing 

up hypotheses three, four and five, both freshmen and sophomores in 

this study recorded expectations and evaluations that seemed to relate 

to esach other's actual experiences rather than their year in college. 

Because the fifth and sixth hypotheses dealt with the differences 

produced by certain variables in relation to expectation's and evaluations, 

% 

each variable will be discussed independently. Bach hypothesis stated 
that no differences would be exhibited in any Cl Category for either 
expectations or evaluations with the following demographic variables: 

type of secondary school attended, size of residence city, social science 

* 

or non-social science major, previous evaluations experience, elective or . 

. 9 

required status of the course, size of secondary school graduating class, 
and sex. Each of the seven variables was tested for expectations and 
evaluations by college employing an uncorrelated t_ test. For separate 
group variance between groups of unequal size, degrees of freedom were 
determined at the mid-point between the paired samples minus one for each 
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pair. The data appear in Tables 10 to 16. 

The comparisons for public with private secondary school 
graduates for each expectations and evaluations category is reported in 
Table 10. In only one instance- -Gamma College--did attendance at 
either public or private school show significantly different expectations. 
The finding for this variable would contribute to accepting the null hy- 
potheses- -that is, the substantive ones — of no differences between types 
of school attended. 

Table 11 presents the data comparing students’ residence city 
size for expectations and evaluations. The two categories of small and 
large city size, noted in the table, resulted in a bimodal distribution of 
responses to the four ranges presented in the expectations instrument 
(see Appendix A). Most students in the three colleges fell into either 
small suburban areas with populations of 50, 000 or fewer people, and 
medium to somewhat larger size cities of 50,000 to 250,000 people. The 
data was organized according to this distribution in order to develop a 
sample size that would enhance differences. Remarkably, no differences 
emerged — the only demographic variable to produce such results. This 
would confirm the null hypotheses, even though the population lacked 
students from large cities. 

The data for social science majors compared with non-social 
science majors appear in Table 12. As can.be readily seen. Beta College 
recorded the only statistically significant difference for an expectations 
category. This evidence would contribute to accepting the null hypotheses. 
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Table 13 discloses the data relating the influence of previous 
instructor rating experience on expectations and evaluations. Gamma 
College showed the only significant differences, one for expectations and 
the ocher for an evaluations category. This, too, would not lend any 
substantial evidence for rejecting the null hypotheses. 

The comparison of students taking the course for elective or 
required credit appears in Table 14. Gamma College. reported the only 
significant differences, with two each for the expectations and for the 
evaluations categories. Again, this evidence would be insufficient to 
reject the null hypotheses. 

Table 15 shows the data comparing students by their secondary 
school graduating class size. Because the responses to the four categories 
noted in the expectations instrument (see Appendix A) again produced a 
bimodal distribution of small- -less than 200 students in a Graduating 
class- -compared with large- -more than 200 in a class- -the data were 
organized around these categories. Alpha College presented no statistically 
significant differences. Beta College revealed opl-y one for an expectations 
category. 

The responses from Gamma Colfege, however, indicated 

♦ 

• 9 

that in four of the six evaluations categories, and one expectations 

category, significant differences were exhibited. The only other variable 

1 0 

• • 
to produce as many differences as secondary class size was sex, as is 

shown by an examination of Table 16 which reports the data for sex 

differences in Bela and Gamma Colleges- -the co- educational ones. 
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While Beta College showed no statistically significant differences when 

sex was compared by Cl category. Gamma College evidenced almost the 

same pattern for differences associated with sex as it did for secondary 

class size. In four of the six expectations categories and two evaluations 

categories, females (N=45) demonstrated statistically significant higher 

means than male students (N = 19). This evidence might appear to support 

% 

the notion that the four- year college students represented an incompatible 
population for comparisons with the two-year colleges, or that Gamma's 
newly arrived men added the unusual dimension for the two variables • 
under consideration. 

The significant differences in sex and class size can be ex- 
plained by examining the actual numbers of females and males in the 
large and small secondary school size groups in Table 15. Females 
outnumbered males in large secondary school by about only two to one 
(F=18; M = ll), but they outnumbered the males three to one (F=26; M : 9) 
in small secondary schools. Since the small school student expectations 
and evaluations group means sighificantly outweighed those of large school 
students, it followed that the sex difference data would show females dis- 
playing higher expectations than males. Although this evidence did not 
explain why the females from smaller secondary schools recorded higher 
expectations and evaluations, it was certainly made clear that the sex 
differences in this newly co- educational institution were not singularly 
responsible for differences in the variables noted by this study. It also 
confirmed what the other component variables for hypotheses five and six 
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demonstrated that the differences among this student population were 

generally slight in relation to the Cl expectations and evaluations 

categories. Taking the foregoing into account, it did not seem to be a 

large risk to accept the null hypotheses for secondary school size and the 

sex variables. It also seemed that with the small number of significant 

differences reported for the other component variables, there would not 

be too great a risk in accepting the null hypotheses here, too. Thus, 

• ) • 

it was accepted that variables associated with the student domain 
would not exhibit statistically significant differences in the Cl expec- 
tations and evaluations categories. 

Questions Considered 

The AVL Study of Values was introduced to determine how 

this student population compared with previous research findings on 

4 

the consistency of its six scales. It was also employed to learn whether 
certain of the Cl expectations or evaluations categories might be 
associated with particular AVL values. The AVL group mean scores 
for the pretest and posttest by sex and college year can be found in 
Table 17 and 18. A_t test for correlated means was computed to determine 
statistically significant differences. Overall the AVL values remained 
constant. Nonetheless, there were some significant specific changes, 
especially for freshmen; fifteen significant changes developed for 

4 They are Theoretical, Economic, Aesthetic, Social, Political 
and Religious. 
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TABLE 18. —Means and Standard Deviations for AVL Scale on Pretest and Posttest by Sex and Year 

in Gamma College 
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freshmen, and eight for sophomores. It seemed clear, however, that 
the general direction of these changes for both groups confirmed 
previous research. Change, that is, that tended to exhibit a decrease 
in the religious scale compared with an increase in the theoretical scale. 

Were there any significant relationships between the AVL 
values and either the Cl expectations or evaluations categories? The 
data by sex and college for expectations were reported in Table 19, and 
for evaluations in Table 20. In both cases a Pearson product-moment 
correlation coefficient ^ was computed for each of the items. No overall 
significant pattern of relationships emerged for either expectations or 
evaluations, however. The only pattern that developed concerned the 
male students at Gamma College, resulting in a pattern of inversion on 
the expectations scale. These students produced a significant positive 
correlation for the AVL political scale with an r of . 69 along with content 
(r , 01 = .. 58) and an r of . 46 for course requirements, (r .05=. 46) in 
comparison with a negative correlation for concern with the instructor’s 
style (r=-. 56). No similar development occurred for these students 
on the evaluations scale. These rather inconclusive findings pointed 
toward additional work needed to examine college student values and 
contextual expectations. , 

**John H. Mueller, Karl F. Schuessler, Herbert L. Costner, 
Statistical Reasoning in Sociology (2d ed. ; Boston: Houghton Mifflin 
Company, 1970), p. 319. 

•V 
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TABLE 19.— Intercorrelation Matrix of Pretest AVL Values with Cl Expectations Categories by 

Sex for Alpha, Beta and Gamma Colleges (Decomal omitted) 
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CHAPTER V 



DISCUSSION, IMPLICATIONS AND RECOMMENDATIONS 

This chapter is organized in three sections. The first 

» ’ 1 • 

section will discuss and summarize the study findings. The next section 
presents implications of these findings. And the last section will make 
recommendations for related future studies. An afterword follows. 

Discussion of Findings 



This study examined student contextual expectations in re- 
lation to their evaluations in the natural setting of the classroom. The 
findings generally supported the substantive hypotheses, namely, that 
student evaluations were closely associated with their, expectations of 
instructors. This was affirmed in the analyses of each class and college 
group means as well as rank order comparisons for the Cl expectations 
and evaluations categories. More importantly, the findings demon- 
strated that statistically significant differences emerged between* 
expectations and evaluations, even for those classes where the actual 

evaluations would probably qualify as being reasonably high. In such 

♦ 

instances, as for Classes A and C in Table 1, significant differences 
occurred in half of the categories. If evaluations were the only measure 



for those classes in Tables 2 and 3, the lower expectations, in comparison 
to stilllower--and significant- -evaluations would surely escape notice. 

So, despite the fact that the students in all the social science classes 
under study exhibited similar directions for contextual expectations 
and evaluations, the actual intensity of these factors varied. In other 
words, making extrapolations from evaluations about the actual nature 
of student-teacher classroom interaction could be of questionable value. 

This study also tried to relate the influence of yearjin college 

* 

to- expectations and evaluations. Here the data substantially indicated that 
freshmen and sophomores exhibited similar contextual expectations in 
.all three colleges. Regarding evaluations, it was the sophomores, not 
the freshmen who consistently revealed higher evaluations- -and in some 
instances statistically significant ones. This would certainly question 
the applicability of the findings relating situational expectations to 
specific classroom contexts. If a ’’Freshman Myth" existed for this 
student population, then it applied to sophomores, too. This was con- 
firmed when freshman and sophomore expectations and evaluations were 
compared as reported in Table 9. Nonetheless, as the subsequent con- 
sideration of the demographic variables in Tables 10 to .16 bore out, 

. * 

the student populations might have been too in-group homogeneous to 

reveal year in college differences. This criticism would not, of course, 

negate the fact that within these specific classroom contexts the 
* # ** • 
differences played by year in college were minimal. 
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Although only intended to be suggestive, consideration of the 
interrelationships between the Cl expectations and evaluations categories 
with certain demographic variables, disclosed some pertinent information 
regarding the three colleges. In order to track down why the only 
significant differences developed for Gamma College were in sex and ' 
secondary class size, it was discovered that the otherwise lack in 
differences for expectations and evaluations might be explained by the 
relative homogeneity of the groups. Nevertheless, it appeared that 
the findings of previous evaluations studies, which noted few substantial 
differences for comparable demographic variables, were confirmed by 
this study's student population. The data also revealed that these 
variables failed to produce significant differences for expectations. 

Indeed, it seemed that greater discrepancies were exhibited with evalua- 
tions categories (see Table 15). 

Another area introduced into the study for suggestive purposes 

sought to relate the Cl expectations and evaluations categories with the 

six AVL value scales. Although the AVL was confirmed as to its test 

and retest reliability as indicated in Table 1J, it did not produce any 

♦ 

overall intercorrelation pattern. The. one significant finding pointed to 
the need for further work relating contextual expectations with respect 
to student values. This result, that males at Gamma College who ex- 
hibited positive expectations for course content and requirements in 
relation to the AVL political scale also revealed negative expectations for 
concern with the instructor's style would provide a likely starting point. 
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If this study found that students in the three colleges responded con- 
sistently on the test and posttest of the Cl expectations and evaluations 
instruments and the AVL value scales, then it would appear desirable to 
assume that further work would discover the relationships between 
expectations and values. A key question, left unanswered by the find- 
ings of this study, was: would the homogeneity that expressed itself 
in a variety of ways here,' emerge in future studies of expectations and 
values ? 



Implications 

This study began as an inquiry into the nature of instructional 
evaluations- -did its findings reveal any implications for currently 
accepted evaluations procedures? If evaluations mean a reaction to the 
instructional process, then the end-of-cours-e rating may still serve 
a valuable function. For as this study confirmed- -in keeping with 
previous evaluations research--an end-of- course rating would be a 
reliable indicator of student reaction. But if closer student-teacher 
interaction becomes desirable, then, the appropriateness of expectation 
inputs would be obvious. Simply, presently conceived end-of- course 
ratings have built-in limitations.' In addition, these evaluations tend 
to create the false impression that the students in any one class remain 

inert, especially if an instructor receives similar ratings over time. 

> ‘ • 

To sum up, even for the three relatively homogeneous student groups. 




the expectations dimension should have added to an understanding of 
teacher- student interaction, or the lack of it. 

The problem of interactiqn raised a question concerning 
the consensual expectations model. The approach called for greater 
participation by student and teacher in defining classroom objectives; 
its corollary would be the increased awareness of each other's expec- 
tations. It might be argued that in order to establish the reasonableness 
of this model, the study had to demonstrate the obvious. That is, 
student evaluations would be different from their expectations. If 
this was so, it resulted from the fact that end-of- course ratings failed 
to include the intensity of student expectations. Moreover, it seemed 
that student expectations continued to play a limited role in college 
classrooms. It became necessary, therefore, to examine whether 
significant differences actually developed in classrooms. Based on this 
study findings, the consensual approach would discourage any evaluation 
cast in isolation from expectations. 

The implications for the study of classroom-- contextual- - 

expectations in relation to institutional- - situational- -expectations were 

•% 

most suggestive. There seemed to be differences in the role played 

. • 

by each set of expectations. For one, year in college- -prominent in 
situational research- -did not prove a significant factor in. determining 
either expectations or evaluations. If the "Freshman. Myth" offered 
by Stern meant that frcshme.i would be more disenchanted with their 
situational expectations', this study advanced that freshman and sophomore 



104 



contextual expectations and evaluations remained .relatively similar. 

These findings suggested the-need for clarification between the sets 
of expectations. As part of that study agenda, it would be important 
to learn how students report each set of expectations. Are the differ- 
ences to be found in the perception of each set? The findings of this 
study— in keeping with Stogdill's framework- -indicated that the role 
and performance aspects of a classroom would clearly account for some 
of the possible differences in reporting contextual expectations. In 
sum, the dynamics of classroom membership compared with in- 
stitutional membership have still to be studied. 

Recommendations for Further Stud y 

This study was limited by the in-group homogeneity of the 
three student groups, and it might be appropriate to replicate the 
design in other multi- institutional settings with a greater cross-section 
of students. Since most evaluations research has taken place in state 
colleges and universities, it would be advantageous to introduce the 
expectations dimension into those settings, too. If single institutions 

must be used, then the sample group should be heterogeneous. Not 

, # 

only would this provide an increased awareness of classroom dynamics, but 
it might also encourage receptivity to greater student-teacher interaction. 
One recommended measure would be the Rokeach Value Scale. If this 
Contributed to understanding the nature of student values, a mors 
comprehensive understanding of student expectations would be in order. 
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too. Here, additional work in the area of instrumentation seemed 
appropriate. Although the present study suggested Osgood's Semantic 
Differential Technique to overcome the increased proliferation of in- 
struments, perhaps other scales could be correlated with it. Any future 
refinements should work to validate expectations- evaluations instruments. 
All in all, more work needs to be done relating the nature and origin of 

- i 

values with expectations. 

Improvements in the study design could be suggested, too. 

One would be to introduce certain controls into the experimental design. 
For example, it could prove worthwhile to manipulate the classroom 
situation-- the independent variable--by employing the concept of con- 
sensual expectations in some groups only. Although many factors would 
require control, it would be important to learn the effects of this approach 
in actual classrooms. This type of experiment could be conducted in 
larger institutions with relative ease. In small colleges further con- 
trols would have to be introduced through a multi- institutional design. 
Finally, further studies might consider the intervening 

variables of class size and other subject areas in relation to the ex- 

* 

pectations- evaluations approach suggested here. Long-range longitudinal 

studies would be needed to determine the impact of actually employing 

this approach on both students and teachers. This type of study would 
• ( . 
help to understand whether consensual expectations could become an 

acceptable classroom, approach. 



